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0 ATTORNEY DOCKET NO. 015389-002610 

HUMAN TELOMERASE CATALYTIC SUBUNIT 

The present application is a continuation-in-part application of U.S. 
5 Patent Application Serial Number 08/91 5,503, filed August 14, 1997, and a 

continuation-in-part application of U.S. Patent Application Serial Number 08/912,951, 
filed August 14, 1997, and a continuation-in-part application of U.S. Patent Application 
Serial Number 08/91 1,312, filed August 14, 1997, all three of which are continuation- 
in-part applications of U.S. Patent Application Serial Number 08/854,050, filed May 9, 

1 0 1997, which is a continuation-in-part application of U.S. Patent Application Serial 

Number 08/851,843, filed May 6, 1997, which is a continuation-in-part application of 
U.S. Patent Application Serial Number 08/846,017, filed April 25, 1997, which is a 
continuation-in-part application of U.S. Patent Application Serial Number 08/844,419, 
filed April 19, 1996, which is a continuation-in-part application of U.S. Patent 

15 Application Serial Number 08/724,643, filed October 1, 1996. This application also 
claims priority to Patent Convention Treaty Patent Application Serial No.: 
PCT/US97/17885 and to Patent Convention Treaty Patent Application Serial No.: 
PCT/US97/17618, both filed in the U.S. Receiving Office on October 1, 1997. Each of 
the aforementioned applications is explicitly incorporated herein by reference in its 

20 entirety and for all purposes. This application also incorporates by reference copending 

U.S. Patent Application Serial Number ['Telomerase Reverse 

Transcriptase," attorney docket no. 015389-002950] filed November 19, 1997, in its 
entirety and for all purposes. 

This invention was made with Government support under Grant No. 

25 GM28039, awarded by the National Institutes of Health. The Government has certain 
rights in this invention. 

FIELD OF THE INVENTION 

The present invention is related to novel nucleic acids encoding the 
30 catalytic subunit of telomerase and related polypeptides. In particular, the present 

1 
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invention is directed to the catalytic subunit of human telomerase. The invention 
provides methods and compositions relating to medicine, molecular biology, chemistry, 
pharmacology, and medical diagnostic and prognostic technology, 

5 BACKGROUND OF THE INVENTION 

The following discussion is intended to introduce the field of the present 
invention to the reader. The citation of various references in this section is not to be 
construed as an admission of prior invention. 

It has long been recognized that complete replication of the ends of 

10 eukaryotic chromosomes requires specialized cell components (Watson, 1972, Nature 
New Biol, 239:197; Olovnikov, 1973, J. Theor. Biol, 41 :181). Replication of a linear 
DNA strand by conventional DNA polymerases requires an RNA primer, and can 
proceed only 5 1 to 3\ When the RNA bound at the extreme 5 T ends of eukaryotic 
chromosomal DNA strands is removed, a gap is introduced, leading to a progressive 

1 5 shortening of daughter strands with each round of replication. This shortening of 

telomeres, the protein-DNA structures physically located on the ends of chromosomes, 
is thought to account for the phenomenon of cellular senescence or aging (see, e.g., 
Goldstein, 1990, Science 249:1129; Martin et al., 1979, Lab. Invest 23:86; Goldstein et 
aL, 1969, Proc. Natl Acad. Set USA 64:155; and Schneider and Mitsui, 1976, Proa 

20 Natl Acad. Set USA, 73:3584) of normal human somatic cells in vitro and in vivo. 

The length and integrity of telomeres is thus related to entry of a cell 
into a senescent stage (i.e., loss of proliferative capacity). Moreover, the ability of a 
cell to maintain (or increase) telomere length may allow a cell to escape senescence, 
i.e., to become immortal. 

25 The structure of telomeres and telomeric DNA has been investigated in 

numerous systems (see, e.g, Harley and Villeponteau, 1995, Curr. Opin. Genet Dev. 
5:249). In most organisms, telomeric DNA consists of a tandem array of very simple 
sequences; in humans and other vertebrates telomeric DNA consists of hundreds to 
thousands of tandem repeats of the sequence TTAGGG. Methods for determining and 

30 modulating telomere length in cells are described in PCT Publications WO 93/23572 
and WO 96/41016. 

2 
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The maintenance of telomeres is a function of a telomere-specific DNA 
polymerase known as telomerase. Telomerase is a ribonucleoprotein (RNP) that uses a 
portion of its RNA moiety as a template for telomere repeat DNA synthesis (Morin, 
1997, Eur. J. Cancer 33:750; Yu et aL, 1990, Nature 344:126; Singer and Gottschling, 
5 1994, Science 266:404; Autexier and Greider, 1994, Genes Develop., 8:563; Gilley et 
aL, 1995, Genes Develop., 9:2214; McEachern and Blackburn, 1995, Nature 367:403; 
Blackburn, 1992, Ann. Rev. Biochem., 61:113;. Greider, 1996, Ann. Rev. Biochem., 
65:337). The RNA components of human and other telomerases have been cloned and 
characterized (see, PCT Publication WO 96/01835 and Feng et al., 1995, Science 

10 269:1236). However, the characterization of the protein components of telomerase has 
been difficult. In part, this is because it has proved difficult to purify the telomerase 
RNP, which is present in extremely low levels in cells in which it is expressed. For 
example, it has been estimated that human cells known to express high levels of 
telomerase activity may have only about one hundred molecules of the enzyme per cell. 

1 5 Consistent with the relationship of telomeres and telomerase to the 

proliferative capacity of a cell (i.e., the ability of the cell to divide indefinitely), 
telomerase activity is detected in immortal cell lines and an extraordinarily diverse set 
of tumor tissues, but is not detected (i.e., was absent or below the assay threshold) in 
normal somatic cell cultures or normal tissues adjacent to a tumor (see 9 U.S. Patent 

20 Nos. 5,629,154; 5,489,508; 5,648,215; and 5,639,613; see also, Morin, 1989, Cell 59: 
521; Shay and Bacchetti 1997, Eur. J. Cancer 33:787; Kim et al., 1994, Science 
266:201 1; Counter et aL, 1992, EMBOJ. 1 1:1921; Counter et al., 1994, Proa Natl 
Acad. Set U.S.A. 91,2900; Counter et al., 1994, J. Virol. 68:3410). Moreover, a 
correlation between the level of telomerase activity in a tumor and the likely clinical 

25 outcome of the patient has been reported (e.g., U.S. Patent No. 5,639,61 3, supra; 
Langford et aL, 1997, Hum. Pathol. 28:416). Telomerase activity has also been 
detected in human germ cells, proliferating stem or progenitor cells, and activated 
lymphocytes. In somatic stem or progenitor cells, and in activated lymphocytes, 
telomerase activity is typically either very low or only transiently expressed (see, Chiu 

30 et aL, 1996, Stem Cells 14:239; Bodnar et al., 1996, Exp. Cell Res. 228:58; Taylor et 
aL, 1996, J. Invest. Dermatology 106: 759). 
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Human telomerase is an ideal target for diagnosing and treating human 
diseases relating to cellular proliferation and senescence, such as cancer. Methods for 
diagnosing and treating cancer and other telomerase-related diseases in humans are 
described in U.S. Patent Nos. 5,489,508, 5,639,613, and 5,645,986. Methods for 
5 predicting tumor progression by monitoring telomerase are described in U.S. Patent No. 
5,639,613. The discovery and characterization of the catalytic protein subunit of 
human telomerase would provide additional useful assays for telomerase and for 
disease diagnosis and therapy. Moreover, cloning and determination of the primary 
sequence of the catalytic protein subunit would allow more effective therapies for 
10 human cancers and other diseases related to cell proliferative capacity and senescence. 

BRIEF SUMMARY OF THE INVENTION 

The present invention provides an isolated, substantially pure, or 
recombinant protein preparation of a telomerase reverse transcriptase protein, or a 
1 5 variant thereof, or a fragment thereof. In one embodiment the protein is characterized 
as having a defined motif that has an amino acid sequence: 

Trp-R r X 7 -R r R r R r X-Phe-Phe^ 
where X is any amino acid and a subscript refers to the number of consecutive residues, 
R x is leucine or isoleucine, R 2 is glutamine or arginine, R 3 is phenylalanine or tyrosine, 
20 and R4 is lysine or histidine. In one embodiment the protein has a sequence of human 
TRT. In other embodiments, the invention relates to peptides and polypeptides sharing 
substantial sequence identity with a subsequence of such proteins. 

In a related embodiment the invention provides an isolated, substantially 
pure or recombinant nucleic acid that encodes a telomerase reverse transcriptase 
25 protein. In one embodiment the nucleic acid encodes a protein comprising an amino 
acid sequence: 

Trp-R r X r R r R r R 2 -X-Phe-Phe^ In 
another embodiment, the nucleic acid has a sequence that encodes the human TRT 
protein. In other embodiments, the invention relates to oligonucleotides and 
30 polynucleotides sharing substantial sequence identity or complementarity with a 
subsequence of such nucleic acids. 
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In one embodiment, the invention relates to human telomerase reverse 
transcriptase (hTRT) protein. Thus, in one embodiment, the invention provides an 
isolated, substantially pure, or recombinant protein preparation of an hTRT protein, or a 
variant thereof, or a fragment thereof. In one embodiment, the protein is characterized 
5 by having an amino acid sequence with at least about 75% or at least about 80% 
sequence identity to the hTRT protein of Figure 17 (SEQUENCE ID NO: 2), or a 
variant thereof, or a fragment thereof. In a related aspect, the hTRT protein has the 
sequence of SEQUENCE ID NO: 2. In some embodiments, the protein has one or 
more telomerase activities, such as catalytic activity. In one embodiment, the hTRT 
10 protein fragment has at least 6 amino acid residues. In other embodiments, the hTRT 
protein fragment has at least 8, at least about 10, at least about 12, at least about 15 or at 
least about 20 contiguous amino acid residues of a naturally occurring hTRT 
polypeptide. In still other embodiments, the hTRT protein fragment has at least about 
! 50 or at least about 100 amino acid residues. 

* = j 1 5 The invention also provides a composition comprising an hTRT protein 

M : and an RNA. The RNA may be a telomerase RNA, such as a human telomerase RNA. 

M In one embodiment, the hTRT protein and the human telomerase RNA (hTR) from a 

ribonucleoprotein complex with a telomerase activity. 
=f In one embodiment, the invention provides isolated human telomerase 

3 20 comprising hTRT protein, such as a substantially pure human telomerase comprising 

hTRT protein and comprising hTR. In one embodiment, the telomerase is at least about 
95% pure. The telomerase may be isolated from a cell, such as a recombinant host cell 
in or a cell that expresses telomerase activity. 

In another aspect, the invention provides an isolated, synthetic, 
25 substantially pure, or recombinant polynucleotide comprising a nucleic acid sequence 
that encodes an hTRT protein. In one embodiment, the polynucleotide has a nucleotide 
sequence encoding an hTRT protein that has an amino acid sequence as set forth in 
Figure 17 (SEQUENCE ID NO:2) or a sequence that comprises one or more 
conservative amino acid (or codon) substitutions or one or more activity-altering amino 
30 acid (or condon) substitutions in said amino acid sequence. In a related aspect, the 
polynucleotide hybridizes under stringent conditions to a polynucleotide having the 
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sequence as set forth in in Figure 16 (SEQUENCE ID NO:l). In another related aspect, 
the nucleotide sequence of the polynucleotide has a smallest sum probability of less 
than about 0.5 when compared to a nucleotide sequence as set forth in Figure 16 
(SEQUENCE ID NO:l) using BLAST algorithm with default parameters. 
5 In another aspect, the invention provides a polynucleotide having a 

promoter sequence operably linked to the sequence encoding the hTRT protein. The 
promoter may be a promoter other than the naturally occurring hTRT promoter. In a 
related aspect, the invention provides an expression vector comprising the promoter of 
the hTRT. 

10 The invention also provides an isolated, synthetic, substantially pure, or 

recombinant polynucleotide that is at least ten nucleotides in length and comprises a 
contiguous sequence of at least ten nucleotides that is identical or exactly 
complementary to a contiguous sequence in a naturally occurring hTRT gene or hTRT 
mRNA. In some embodiments the polynucleotide is an RNA, a DNA, or contains one 

1 5 or more non-naturally occurring, synthetic nucleotides. In one aspect, the 

polynucleotide is identical or exactly complementary to the contiguous sequence of at 
least ten contiguous nucleotides in a naturally occurring hTRT gene or hTRT mRNA. 
For example, the polynucleotide may be an antisense polynucleotide. In one 
embodiment, the antisense polynucleotide comprises at least about 20 nucleotides. 

20 The invention further provides a method of preparing recombinant 

telomerase by contacting a recombinant hTRT protein with a telomerase RNA 
component under conditions such that said recombinant protein and said telomerase 
RNA component associate to form a telomerase enzyme capable of catalyzing the 
addition of nucleotides to a telomerase substrate. In one embodiment, the hTRT protein 

25 has a sequence as set forth in Figure 1 7 (SEQUENCE ID NO:2). The hTRT protein 
may be produced in an in vitro expression system and mixed with a telomerase RNA 
or, in another embodiment, the telomerase RNA can be co-expressed in the in vitro 
expression system. In one embodiment the telomerase RNA is hTR. In an alternative 
embodiment, the contacting occurs in a cell, such as a human cell. In one embodiment, 

30 the cell does not have telomerase activity prior to the contacting of the hTRT and the 
RNA, or the introduction, such as by transfection, of an hTRT polynucleotide. In one 

6 



embodiment, the telomerase RNA is expressed naturally by said cell. 

The invention also provides a cell, such as a human, mouse, or yeast 
cell, containing the recombinant polynucleotides of the invention such as a 
polynucleotide with an hTRT protein coding sequence operably linked a promoter. In 
5 particular aspects, the cell is a vertebrate cell, such as a cell from a mammal, for 

example a human, and has an increased proliferative capacity relative to a cell that is 
otherwise identical but does not comprise the recombinant polynucleotide or has an 
increased telomerase activity level relative to a cell that is otherwise identical but does 
not comprise the recombinant polynucleotide. In some embodiments the cell is 
10 immortal. 

In related embodiments, the invention provides organisms and cells 
comprising a polynucleotide encoding a human telomerase reverse transcriptase 
polypeptide, such as a transgenic non-human organism such as a yeast, plant, 
bacterium, or a non-human animal, for example, a mouse. The invention also provides 

1 5 for transgenic animals and cells from which an hTRT gene has been deleted (knocked- 
out) or mutated such that the gene does not express a naturally occurring hTRT gene 
product. Thus, in alternative embodiments, the transgenic non-human animal has a 
mutated telomerase gene, is an animal deficient in a telomerase activity, is an animal 
whose TRT deficiency is a result of a mutated gene encoding a TRT having a reduced 

20 level of a telomerase activity compared to a wild-type TRT and is an animal having a 
mutated TRT gene with one or more mutations, including missense mutations, 
nonsense mutations, insertions, or deletions. 

The invention also provides an isolated or recombinant antibody, or 
fragment thereof, that specifically binds to an hTRT protein. In one embodiment, the 

25 antibody binds with an affinity of at least about 1 0 8 M" 1 . The antibody may be 

monoclonal or may be a polyclonal composition, such as a polyclonal antisera. In a 
related aspect, the invention provides a cell capable of secreting the antibody, such as a 
hybridoma. 

The invention also provides a method for determining whether a 
30 compound or treatment is a modulator of a telomerase reverse transcriptase activity or 
hTRT expression. The method involves detecting or monitoring a change in activity or 
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expression in a cell, animal or composition comprising an hTRT protein or 
polynucleotide following administration of the compound or treatment. In one 
embodiment, the method includes the steps of: providing a TRT composition, 
contacting the TRT with the test compound and measuring the activity of the TRT, 
5 where a change in TRT activity in the presence of the test compound is an indicator 
that the test compound modulates TRT activity. In certain embodiments, the 
composition is a cell, an organism, a transgenic organism or an in vitro system, such as 
an expression system, which contains a recombinant polynucleotide encoding an hTRT 
polypeptide. Thus, the hTRT of the method may be a product of in vitro expression. In 

1 0 various embodiments the detection of telomerase activity or expression may be by 

detecting a change in abundance of an hTRT gene product, monitoring incorporation of 
a nucleotide label into a substrate for telomerase, monitoring hybridization of a probe to 
an extended telomerase substrate, monitoring amplification of an extended telomerase 
substrate, monitoring telomere length of a cell exposed to the test compound, 

1 5 monitoring the loss of the ability of the telomerase to bind to a chromosome, or 
measuring the accumulation or loss of telomere structure. 

In one aspect, the invention provides a method of detecting an hTRT 
gene product in a biological sample by contacting the biological sample with a probe 
that specifically binds the gene product, wherein the probe and the gene product form a 

20 complex, and detecting the complex, where the presence of the complex is correlated 

with the presence of the hTRT gene product in the biological sample. The gene product 
may be RNA, DNA or a polypeptide. Examples of probes that may be used for 
detection include, but are not limited to, nucleic acids and antibodies. 

In one embodiment, the gene product is a nucleic acid which is detected 

25 by amplifying the gene and detecting the amplification product, where the presence of 
the complex or amplification product is correlated with the presence of the hTRT gene 
product in the biological sample. 

In one embodiment, the biological sample is from a patient, such as a 
human patient. In another embodiment the biological sample includes at least one cell 

30 from an in vitro cell culture, such as a human cell culture. 

The invention further provides a method of detecting the presence of at 

8 
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least one immortal or telomerase positive human cell in a biological sample comprising 
human cells by obtaining the biological sample comprising human cells; and detecting 
the presence in the sample of a cell having a high level of an hTRT gene product, where 
the presence of a cell having a high level of the hTRT gene product is correlated with 
5 the presence of immortal or telomerase positive cells in the biological sample. 

The invention also provides a method for diagnosing a telomerase- 
related condition in a patient by obtaining a cell or tissue sample from the patient, 
determining the amount of an hTRT gene product in the cell or tissue; and comparing 
the amount of hTRT gene product in the cell or tissue with the amount in a healthy cell 

1 0 or tissue of the same type, where a different amount of hTRT gene product in the 
sample from the patient and the healthy cell or tissue is diagnostic of a telomerase- 
related condition. In one embodiment the telomerase-related condition is cancer and a 
greater amount of hTRT gene product is detected in the sample. 

The invention further provides a method of diagnosing cancer in a 

15 patient by obtaining a biological sample from the patient, and detecting a hTRT gene 
product in the patient sample, where the detection of the hTRT gene product in the 
sample is correlated with a diagnosis of cancer. 

The invention further provides a method of diagnosing cancer in a 
patient by obtaining a patient sample, determining the amount of hTRT gene product in 

20 the patient sample; and comparing the amount of hTRT gene product with a normal or 
control value, where an amount of the hTRT gene product in the patient that is greater 
than the normal or control value is diagnostic of cancer. 

The invention also provides a method of diagnosing cancer in a patient, 
by obtaining a patient sample containing at least one cell; determining the amount of 

25 an hTRT gene product in a cell in the sample; and comparing the amount of hTRT gene 
product in the cell with a normal value for the cell, wherein an amount of the hTRT 
gene product greater than the normal value is diagnostic of cancer. In one embodiment, 
the sample is believed to contain at least one malignant cell. 

The invention also provides a method for a prognosing a cancer patient 

30 by determining the amount of hTRT gene product in a cancer cell obtained from the 
patient; and comparing the amount of hTRT in the cancer cell with a prognostic value 

9 
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of hTRT consistent with a prognosis for the cancer; where an amount of hTRT in the 
sample that is at the prognostic value provides the particular prognosis. 

The invention also provides a method for monitoring the ability of an 
anticancer treatment to reduce the proliferative capacity of cancer cells in a patient, by 
5 making a first measurement of the amount of an hTRT gene product in at least one 
cancer cell from the patient; making a second measurement of the level of the hTRT 
gene product in at least one cancer cell from the patient, wherein the anticancer 
treatment is administered to the patient before the second measurement; and comparing 
the first and second measurements, where a lower level of the hTRT gene product in the 

1 0 second measurement is correlated with the ability of an anticancer treatment to reduce 
the proliferative capacity of cancer cells in the patient. 

The invention also provides kits for the detection of an hTRT gene or 
gene product. In one embodiment, the kit includes a container including a molecule 
selected from an hTRT nucleic acid or subsequence thereof, an hTRT polypeptide or 

1 5 subsequence thereof, and an anti-hTRT antibody. 

The invention also provides methods of treating human diseases. In one 
embodiment, the invention provides a method for increasing the proliferative capacity 
of a vertebrate cell, such as a mammalian cell, by introducing a recombinant 
polynucleotide into the cell, wherein said polynucleotide comprises a sequence 

20 encoding an hTRT polypeptide. In one embodiment, the hTRT polypeptide has a 

sequence as shown in Figure 17. In one embodiment, the sequence is operably linked 
to a promoter. In one embodiment, the hTRT has telomerase catalytic activity. In one 
embodiment, the cell is human, such as a cell in a human patient. In an alternative 
embodiment, the cell is cultured in vitro. In a related embodiment, the cell is 

25 introduced into a human patient. 

The invention further provides a method for treating a human disease by 
introducing recombinant hTRT polynucleotide into at least one cell in a patient. In one 
embodiment, a gene therapy vector is used. In a related embodiment, the method 
further consists of introducing into the cell a polynucleotide comprising a sequence 

30 encoding hTR, for example, an hTR polynucleotide operably linked to a promoter. 

The invention also provides a method for increasing the proliferative 

10 
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capacity of a vertebrate cell, said method comprising introducing into the cell an 
effective amount of hTRT polypeptide. In one embodiment the hTRT polypeptide has 
telomerase catalytic activity. The invention further provides cells and cell progeny with 
increased proliferative capacity. 
5 The invention also provides a method for treating a condition associated 

with an elevated level of telomerase activity within a cell, comprising introducing into 
said cell a therapeutically effective amount of an inhibitor of said telomerase activity, 
wherein said inhibitor is an hTRT polypeptide or an hTRT polynucleotide. In one 
embodiment, the inhibitor is a polypeptide or polynucleotide comprising, e.g., at least a 

10 subsequence of a sequence shown in Figures 16, 17, or 20. In additional embodiments, 
the polypeptide or polynucleotide inhibits a TRT activity, such as binding of 
endogenous TRT to telomerase RNA. 

The invention also provides a vaccine comprising an hTRT polypeptide 
and an adjuvant. The invention also provides pharmacological compositions containing 

15 a pharmaceutically acceptable carrier and a molecule selected from: an hTRT 

polypeptide, a polynucleotide encoding an hTRT polypeptide, and an hTRT nucleic 
acid or subsequence thereof. 

DESCRIPTION OF THE FIGURES 

20 Figure 1 shows highly conserved residues in TRT motifs from human, S. 

pombe (tezl), S. cerevisiae (EST2) and Euplotes aediculatus (pl23). Identical amino 
acids are indicated with an asterisk (*) [raised slightly], while the similar amino acid 
residues are indicated by a dot (•). Motif "0" in the figure is also called Motif T; Motif 
"3" is also called Motif A. 

25 Figure 2 shows the location of telomerase-specific and RT-specific 

sequence motifs of telomerase proteins and other reverse transcriptases. Locations of 
telomerase-specific motif T and conserved RT motifs 1, 2 and A-E are indicated by 
boxes. The open rectangle labeled HIV-1 RT delineates the portion of this protein 
shown in Figure 3. 

30 Figure 3 shows the crystal structure of the p66 subunit of HIV-1 reverse 

transcriptase (Brookhaven code 1HNV). The view is from the back of the right hand to 

11 
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enable all motifs to be shown. 

Figure 4 shows multiple sequence alignment of telomerase RTs 
(SpJTrtlp, S. pombe TRT [also referred to herein as "tezlp"]; hTRT, human TRT; 
Ea_pl23, Euplotes pi 23; Sc_Est2p, S. cerevisiae Est2p) and members of other RT 
5 families (Sc__al, cytochrome oxidase group II intron 1 -encoded protein from S. 
cerevisiae mitochondria, Dm_TART, reverse transcriptase from Drosophila 
melanogaster TART non-LTR retrotransposable element); HIV-1, human 
immunodeficiency virus reverse transcriptase). TRT con and RT con represent 
consensus sequences for telomerase RTs and non-telomerase RTs. Amino acids are 
10 designated with an h, hydrophobic; p, polar; c, charged. Triangles show residues that 
are conserved among telomerase proteins but different in other RTs. The solid line 
below motif E highlights the primer grip region. 

Figure 5 shows expression of hTRT RNA in telomerase-negative mortal 
cell strains and telomerase-positive immortal cell lines as described in Example 2. 
1 5 Figure 6 shows a possible phylogenetic tree of telomerases and 

retroelements rooted with RNA-dependent RNA polymerases. 

Figure 7 shows a restriction map of lambda clone G(}>5. 
Figure 8 shows a map of chromosome 5p with the location of the STS 
marker D5S678 (located near the hTRT gene) indicated. 
20 Figure 9 shows the construction of a hTRT promoter-reporter plasmid. 

Figure 10, in two pages, shows coexpression in vitro of hTRT and hTR 
to produce catalyticaliy active human telomerase. 

Figure 1 1 , in two pages, shows an alignment of sequences from four 
TRT protein and identifies motifs of interest. TRT con shows a TRT consensus 
25 sequence. RT con shows consensus residues for other reverse transcriptases. 

Consensus residues in upper case indicate absolute conservation in TRT proteins. 

Figure 12 shows a Topoisomerase II cleavage site and NFkB binding 
site motifs in an hTRT intron, with the sequence shown corresponding to SEQUENCE 
IDNO:7. 

30 Figure 13, in two pages, shows the sequence of the DNA encoding the 

Euplotes 123 kDa telomerase protein subunit (Euplotes TRT). 

12 
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Figure 14 shows the amino acid sequence of the Euplotes 123 kDa 
telomerase protein subunit {Euplotes TRT protein). 

Figure 15, in five pages, shows the DNA and amino acid sequences of 
the S. pombe telomerase catalytic subunit (& pombe TRT). 
5 Figure 16, in two pages, shows the hTRT cDNA sequence, with the 

sequence shown corresponding to SEQUENCE ID NO: 1. 

Figure 17 shows the hTRT protein encoded by the cDNA of Figure 16. 
The protein sequence shown corresponds to SEQUENCE ID NO: 2. 

Figure 18 shows the sequence of clone 712562, with the sequence 
1 0 shown corresponding to SEQUENCE ID NO: 3 . 

Figure 19 shows a 259 residue protein encoded by clone 712562, with 
the sequence shown corresponding to SEQUENCE ID NO: 10. 

Figure 20 shows, in seven pages, the sequence of a nucleic acid with an 
open reading frame encoding a A 182 variant polypeptide, with the sequence shown 
15 corresponding to SEQUENCE ID NO: 4. This Figure also shows the amino acid 
sequence of this A 182 variant polypeptide, with the amino acid sequence shown 
corresponding to SEQUENCE ID NO: 5. 

Figure 21 shows, in six pages, sequence from an hTRT genomic clone, 
with the sequence shown corresponding to SEQUENCE ID NO: 6. Consensus motifs 
20 and elements are indicated, including sequences characteristic of a topoisomerase II 
cleavage site, NFkB binding sites, an Alu sequence and other sequence elements. 

Figure 22 shows the effect of mutation of the TRT gene in yeast, as 
described in Example 1. 

Figure 23 shows the sequence of EST AA281296, corresponding to 
25 SEQUENCE ID NO: 8. 

Figure 24 shows the sequence of the 1 82 basepairs deleted in clone 
712562, with the sequence shown corresponding to SEQUENCE ID NO: 9. 

Figure 25 shows the results of an assay for telomerase activity from BJ 
cells transfected with an expression vector encoding an hTRT protein (pGRN133) or a 
30 control plasmid (pBBS212) as described in Example 13. 

Figure 26 is a schematic diagram of the affinity purification of 
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telornerase showing the binding and displacement elution steps. 

Figure 27 is a photograph of a Northern blot of telornerase preparations 
obtained during a purification protocol, as described in Example 1 . Lane 1 contained 
L5 finol telornerase RNA, lane 2 contained 4.6 fmol telornerase RNA, lane 3 contained 
5 14 fmol telornerase RNA, lane 4 contained 41 finol telornerase RNA, lane 5 contained 
nuclear extract (42 finol telornerase), lane 6 contained Affi-Gel-heparin-purified 
telornerase (47 fmol telornerase), lane 7 contained affinity-purified telornerase (68 
fmol), and lane 8 contained glycerol gradient-purified telornerase (35 fmol). 

Figure 28 shows telornerase activity through a purification protocol. 
1 0 Figure 29 is a photograph of a SDS-PAGE gel, showing the presence of 

an approximately 123 kDa polypeptide and an approximately 43 kDa doublet from 
Euplotes aediculatus. 

Figure 30 is a graph showing the sedimentation coefficient of Euplotes 
aediculatus telornerase. 
1 5 Figure 3 1 is a photograph of a polyacrylamide/urea gel with 36% 

formamide showing the substrate utilization of Euplotes telornerase. 

Figure 32 shows the putative alignments of telornerase RNA template, 
and hairpin primers with telornerase RNA. 

Figure 33 is a photograph of lanes 25-30 of the gel shown in Figure 31, 
20 shown at a lighter exposure level. 

Figure 34 shows the DNA sequence of the gene encoding the 43 kDa 
telornerase protein subunit from Euplotes. 

Figure 35 shows, in four pages, the DNA sequence, as well as the amino 
acid sequences of all three open reading frames of the 43 kDa telornerase protein 
25 subunit from Euplotes. 

Figure 36 shows a sequence comparison between the 123 kDa 
telornerase protein subunit of Euplotes (upper sequence) and the 80 kDa polypeptide 
subunit of T. thermophila (lower sequence). 

Figure 37 shows a sequence comparison between the 123 kDa 
30 telornerase protein subunit of E. aediculatus (upper sequence) and the 95 kDa 
telornerase polypeptide of 71 thermophila (lower sequence). 

14 
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Figure 38 shows the best-fit alignment between a portion of the "La- 
domain" of the 43 kDa telomerase protein subunit of £ aediculatus (upper sequence) 
and a portion of the 95 kDa polypeptide subunit of T. thermopkila (lower sequence). 

Figure 39 shows the best-fit alignment between a portion of the "La- 
5 domain" of the 43 kDa telomerase protein subunit of E. aediculatus (upper sequence) 
and a portion of the 80 kDa polypeptide subunit of T. thermophila (lower sequence). 

Figure 40 shows the alignment and motifs of the polymerase domain of 
the 123 kDa telomerase protein subunit of E. aediculatus and the polymerase domains 
of various reverse transcriptases including a cytochrome oxidase group II intron 
10 1 -encoded protein from S. cerevisiae mitochondria (al Sx. (group II)), Dong (LINE), 
and yeast ESTp (L8543.12). 

Figure 41 shows the alignment of a domain of the 43 kDa telomerase 
protein subunit with various La proteins. 

Figure 42 shows the nucleotide sequence encoding the T. thermophila 
15 80 kDa protein subunit. 

Figure 43 shows the amino acid sequence of the T. thermophila 80 kDa 
protein subunit. 

Figure 44 shows the nucleotide sequence encoding the T. thermophila 
95 kDa protein subunit. 
20 Figure 45 shows the amino acid sequence of the T. thermophila 95 kDa 

protein subunit. 

Figure 46 shows the amino acid sequence of L8543.12 ("Est2p"). 

Figure 47 shows the alignment of the amino acid sequence encoded by 
the Oxytricha PGR product with the Euplotes pi 23 sequence. 
25 Figure 48 shows the DNA sequence of Est2. 

Figure 49 shows partial amino acid sequence from a cDNA clone 
encoding human telomerase peptide motifs. 

Figure 50 shows partial DNA sequence of a cDNA clone encoding 
human telomerase peptide motifs. 
30 Figure 5 1 shows the amino acid sequence of tezl, also called S. pombe 

trt. 
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Figure 52 shows, in two pages, the DNA sequence of tezL Intronic and 
other non-coding regions are shown in lower case and exons {i.e., coding regions) are 
shown in upper case. 

Figure 53 shows the alignment of EST2p, Euplotes, and Tetrahymena 
sequences, as well as consensus sequence. 

Figure 54 shows the sequences of peptides useful for production of anti- 
hTRT antibodies. 

Figure 55 is a schematic summary of the tezi Sequencing experiments. 

Figure 56 shows two degenerate primers used in PCR to identify the S. 
pombe homo log of the E. aediculatus pi 23 sequences. 

Figure 57 shows the four major bands produced in PCR using 
degenerate primers to identify the S. pombe homolog of the E. aediculatus pi 23 
sequences. 

Figure 58 shows the alignment of the M2 PCR product with E. 
aediculatus pi 23, S. cerevisiae, and Oxytricha telomerase protein sequences. 

Figure 59 is a schematic showing the 3 f RT PCR strategy for identifying 
the S. pombe homolog of the E. aediculatus pi 23. 

Figure 60 shows characteristics of the libraries used to screen for S. 
pombe telomerase protein sequences and shows the results of screening the libraries for 
S. pombe telomerase protein sequences. 

Figure 61 shows the positive results obtained with the ffindlll-digested 
positive genomic clones containing S. pombe telomerase sequence. 

Figure 62 is a schematic showing the 5 f RT PCR strategy used to obtain 
a full length S. pombe TRT clone. 

Figure 63 shows the alignment of RT domains from telomerase catalytic 
subunits for S. pombe (S.p.), S. cerevisiae (S.c.) and K aediculatus (E.a.). 

Figure 64 shows the alignment of the sequences from Euplotes 
("Ea_j>123"), S. cerevisiae ( ,, Sc_Est2p"), and S. pombe ("Sp JTezlp"). In Panel A, the 
shaded areas indicate residues shared between two sequences. In Panel B, the shaded 
areas indicate residues shared between all three sequences. 

Figure 65 shows the disruption strategy used with the telomerase genes 
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in & pombe. 

Figure 66 shows the experimental results confirming disruption of tezl. 

Figure 67 shows the progressive shortening of telomeres in S. pombe 
due to tezl disruption. 

Figure 68 shows, in four pages, the DNA and amino acid of the ORF 
encoding an approximately 63 kDa telomerase protein encoded by the EcoRI-NotI 
insert of clone 712562. 

Figure 69 shows an alignment of reverse transcriptase motifs from 
various sources. 

Figure 70 provides a restriction and function map of plasmid pGRN121. 

Figure 71 shows, in two pages, the results of preliminary nucleic acid 
sequencing analysis of a hTRT cDNA sequence. 

Figure 72 shows, in ten pages, the preliminary nucleic acid sequence of 
hTRT and deduced ORF sequences in three reading frames. 

Figure 73 provides a restriction and function map of plasmid pGRN121. 

Figure 74 shows, in eight pages, refined nucleic acid sequence and 
deduced ORF sequences of hTRT 

Figure 75 shows a restriction map of lambda clone 25-1.1. 
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DETAILED DESCRIPTION OF THE INTENTION 
I. INTRODUCTION 

Telomerase is a ribonucleoprotein complex (RNP) comprising an RNA 
component and a catalytic protein component. The present invention relates to the 
5 cloning and characterization of the catalytic protein component of telomerase, 

hereinafter referred to as "TRT" (telomerase reverse transcriptase). TRT is so named 
because this protein acts as an RNA-dependent DNA polymerase (reverse 
transcriptase), using the telomerase RNA component (hereinafter, "TR") to direct 
synthesis of telomere DNA repeat sequences. Moreover, TRT is evolutionarily related 

10 to other reverse transcriptases (see Example 1 2). 

In one aspect, the present invention relates to the cloning and 
characterization of the catalytic protein component of human telomerase, hereinafter 
referred to as "hTRT." Human TRT is of extraordinary interest and value because, as 
noted supra, telomerase activity in human (and other mammalian cells) correlates with 

15 cell proliferative capacity, cell immortality, and the development of a neoplastic 

phenotype. For example, telomerase activity, and, as demonstrated in Example 2, infra, 
levels of human TRT gene products and are elevated in immortal human cells (such as 
malignant tumor cells and immortal cell lines) relative to mortal cells (such as most 
human somatic cells). 

20 The present invention further provides methods and compositions 

valuable for diagnosis, prognosis, and treatment of human diseases and disease 
conditions, as described in some detail infra. Also provided are methods and reagents 
useful for immortalizing cells (in vivo and ex v/vo), producing transgenic animals with 
desirable characteristics, and numerous other uses, many of which are described infra. 

25 The invention also provides methods and reagents useful for preparing, cloning, or re- 
cloning TRT genes and proteins from ciliates, fungi, vertebrates, such as mammals, and 
other organisms. 

As described in detail infra, TRT was initially characterized following 
purification of telomerase from the ciliate Euplotes aediculatits. Extensive purification 
30 of E. aediculatus telomerase, using RNA-affinity chromatography and other methods, 
yielded the protein "pi 23". Surprisingly, pi 23 is unrelated to proteins previously 
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Mol Biol 215:403). Searching this database with the Est2p sequence did not indicate a 
match, but searching with pl23 and trtl sequences identified a human EST (Genbank 
accession no. AA281296; see SEQUENCE ID NO: 8), as described in Example 1, 
putatively encoding a homologous protein. Complete sequencing of the cDNA clone 
5 containing the EST (hereinafter, "clone 712562"; see SEQUENCE ID NO: 3) showed 
that seven RT motifs were present. However, this clone did not encode a contiguous 
human TRT with all seven motifs, because motifs B f , C, D, and E were contained in a 
different open reading frame (ORF) than the more NH 2 -terminal motifs. In addition, 
the distance between motifs A and B 1 was substantially shorter than that of the three 

10 previously characterized TRTs. Clone 712562 was obtained from the I.M.A.G.E. 
Consortium; Lennon et aL, 1996, Genomics 33:151. 

A cDNA clone, pGRN121, encoding a functional hTRT (see Figure 16, 
SEQUENCE ID NO: 1) was isolated from a cDNA library derived from the human 293 
cell line as described in Example 1. Comparing clone 712562 with pGRN121 showed 

1 5 that clone 712562 has a 1 82 base pair (see Figure 24, SEQUENCE ID NO: 9) deletion 
between motifs A and B'. The additional 182 base pairs present in pGRN121 place all 
of the TRT motifs in a single open reading frame, and increase the spacing between the 
motif A and motif B* regions to a distance consistent with the other known TRTs. As is 
described infra in the Examples (e.g., Example 7), SEQUENCE ID NO: 1 encodes a 

20 catalytically active telomerase protein having the sequence of SEQUENCE ID NO: 2* 
The polypeptide of SEQUENCE ID NO: 2 has 1 132 residues and a calculated 
molecular weight of about 127 kilodaltons (kD). 

As is discussed infra, and described in Example 9, infra, TRT cDNAs 
possessing the 182 basepair deletion characteristic of the clone 712562 are detected 

25 following reverse transcription of RNA from telomerase-positive cells (e.g., testis and 
293 cells). hTRT RNAs lacking this 182 base pair sequence are referred to generally as 
"A 182 variants" and may represent one, two, or several species. Although the hTRT 
variants lacking the 182 basepair sequence found in the pGRN121 cDNA are unlikely 
to encode a fully active telomerase catalytic enzyme, they may play a role in telomerase 

30 regulation, as discussed infra 9 and/or have partial telomerase activity, such as telomere 
binding or hTR binding activity, as discussed infra. 
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Thus, in one aspect, the present invention provides an isolated 
polynucleotide with a sequence of a naturally occurring human. TRT gene or mRNA 
including, but not limited to, a polynucleotide having the sequence as set forth in Figure 
16 (SEQUENCE ID NO: 1). In a related aspect, the invention provides a 
5 polynucleotide encoding an hTRT protein, fragment, variant or derivative. In another 
related aspect, the invention provides sense and antisense nucleic acids that bind to an 
hTRT gene or mRNA. The invention further provides hTRT proteins, whether 
synthesized or purified from natural sources, as well as antibodies and other agents that 
specifically bind an hTRT protein or a fragment thereof The present invention also 

10 provides many novel methods, including methods that employ the aforementioned 

compositions, for example, by providing diagnostic and prognostic assays for human 
diseases, methods for developing therapeutics and methods of therapy, identification of 
telomerase-associated proteins, and methods for screening for agents capable of 
activating or inhibiting telomerase activity. Numerous other aspects and embodiments 

15 of the invention are provided infra. 

One aspect of the invention is the use of a polynucleotide that is at 
least ten nucleotides to about 10 kb or more in length and comprises a contiguous 
sequence of at least ten nucleotides that is identical or exactly complementary to a 
contiguous sequence in a naturally occurring hTRT gene or hTRT mRNA in assaying 

20 or screening for an hTRT gene sequence or hTRT mRNA, or in preparing a 
recombinant host cell. 

A further aspect of the invention is the use of an agent increasing 
expression of hTRT in the manufacture of a medicament for the treatment of a 
condition addressed by increasing proliferative capacity of a vertebrate cell, 

25 optionally the medicament being for inhibiting the effects of aging. 

Yet a further aspect of the invention is the use of an inhibitor of 
telomerase activity in the manufacture of a medicament for the treatment of a 
condition associated with an elevated level of telomerase activity within a human cell. 
The proteins, variants and fragments of the invention, and the 

30 encoding polynucleotides or fragments, are also each provided in a further aspect of 
this invention for use as a pharmaceutical. 
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The invention further includes the use of a protein, variant or 
fragment, or of a polynucleotide or fragment, in each case as defined herein, in the 
manufacture of a medicament, for example in the manufacture of a medicament for 
inhibiting an effect of aging or cancer. 
5 Another aspect of the invention is a polynucleotide selected from: 

(a) the DNA having a sequence as set forth in Figure 16; 

(b) a polynucleotide of at least 10 nucleotides which hybridizes to the 
foregoing DNA and which codes for an hTRT protein or variant or which hybridizes to 
a coding sequence for such a variant; and, 

10 (c) DNA sequences which are degenerate as a result of the genetic code 

to the DNA sequences defined in (a) and (b) and which code for an hTRT polypeptide 
or variant. 

In certain embodiments of the present invention, the hTRT 
polynucleotides are other than the 389 nucleotide polynucleotide of SEQUENCE ID 

15 NO:8 and/or other than clone 712562, the plasmid containing an insert, the sequence of 
which insert is shown in Figure 18 (SEQUENCE ID NO:3). 

The description below is organized by topic. Part II further describes 
amino acid motifs characteristic of TRT proteins, as well as TRT genes encoding 
proteins having such motifs. Parts III-VI describe, inter alia, nucleic acids, proteins, 

20 antibodies and purified compositions of the invention with particular focus on human 
TRT related compositions. Part VII describes, inter alia, methods and compositions of 
the invention useful for treatment of human disease. Part VIII describes production and 
identification of immortalized human cell lines. Part IX describes, inter alia, uses of 
the nucleic acids, polynucleotides, and other compositions of the invention for 

25 diagnosis of human diseases. Part X describes, inter alia, methods and compositions of 
the invention useful for screening and identifying agents and treatments that modulate 
(e.g., inhibit or promote) telomerase activity or expression. Part XI describes, inter 
alia, transgenic animals (e.g., telomerase knockout animals and cells). Part XII is a 
glossary of terms used in Parts I-XI. Part XIII describes examples relating to specific 

30 embodiments of the invention. The organization of the description of the invention by 
topic and subtopic is to provide clarity, and not to be limiting in any way. 
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II. TRT GENES AND PROTEINS 

The present invention provides isolated and/or recombinant genes and 
proteins having a sequence of a telomerase catalytic subunit protein (i.e., telomerase 
reverse transcriptase), including, but not limited to, the naturally occurring forms of 
5 such genes and proteins in isolated or recombinant form. Typically, TRTs are large, 
basic, proteins having reverse transcriptase (RT) and telomerase-specific (T) amino 
acid motifs, as disclosed herein. Because these motifs are conserved across diverse 
organisms, TRT genes of numerous organisms may be obtained using the methods of 
the invention or identified using primers, nucleic acid probes, and antibodies of the 

1 0 invention, such as those specific for one or more of the motif sequences. 

The seven RT motifs found in TRTs, while similar to those found in 
other reverse transcriptases, have particular hallmarks. For example, as shown in 
Figure 4, within the TRT RT motifs there are a number of amino acid substitutions 
(marked with arrows) in residues highly conserved among the other RTs. For example, 

1 5 in motif C the two aspartic acid residues (DD) that coordinate active site metal ions 
(see, Kohlstaedt et al., 1992, Science 256:1783; Jacobo-Molina et al., 1993, Proc. 
Natl. AcadScL U.S.A. 90:6320; Patel et al., 1995, Biochemistry 34:5351) occur in the 
context hxDD(F/Y) in the telomerase RTs compared to (F/Y)xDDh in the other RTs 
(where "h" is a hydrophobic amino acid, and "x" is any amino acid; see Xiong et al., 

20 1990, EMBO J. 9:3353; Eickbush, in The Evolutionary Biology of Viruses, (S. Morse, 
Ed., Raven Press, NY, p. 121, 1994)). Another systematic change characteristic of the 
telomerase subgroup occurs in motif E, where WxGxSx is a consensus sequence or is 
conserved among the telomerase proteins, whereas hLGxxh is characteristic of other 
RTs (Xiong et al., supra; Eickbush supra). This motif E is called the "primer grip", and 

25 mutations in this region have been reported to affect RNA priming but not DNA 

priming (Powell et al., 1997, J. Biol Chem. 272:13262). Because telomerase requires 
a DNA primer (e.g., the chromosome 3 f end), it is not unexpected that telomerase 
should differ from other RTs in the primer grip region. In addition, the distance 
between motifs A and B' is longer in the TRTs than is typical for other RTs, which may 

30 represent an insertion within the "fingers" region of the structure which resembles a 
right hand (Figure 3; see Kohlstaedt et al., supra; Jacobo-Molina et al., supra; and 
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Patel et al., supra). 



Moreover, as noted supra, Motif T is an additional hallmark of TRT 



proteins. This Motif T, as shown, for example in Figure 4 (W-L-X-Y-X-X-h-h-X-h-h- 
X-p-F-F-Y-X-T-E-X-p-X-X-X-p-X-X-X-Y-X-R-K-X-X-W [X is any amino acid, h is 
5 hydrophobic, p is polar]), comprises a sequence that can be described using the 
formula: 



where X is any amino acid and the subscript refers to the number of consecutive 
10 residues, Rj is leucine or isoleucine, R 2 is glutamine or arginine, R 3 is phenyalanine or 
tyrosine, and R, is lysine or histidine. 

The T motif can also be described using the formula: 



1 5 where X is any amino acid and a subscript refers to the number of consecutive residues, 
R, is leucine or isoleucine, R 2 is glutamine or arginine, R 3 is phenyalanine or tyrosine, 
R, is lysine or histidine, h is a hydrophobic amino acid selected from Ala, Leu, He, Val, 
Pro, Phe, Trp, and Met, and p is a polar amino acid selected from Gly, Ser, Thr, Tyr, 
Cys, Asn and Gin. 

20 In one embodiment, the present invention provides isolated naturally 

occurring and recombinant TRT proteins comprising one or more of the motifs 
illustrated in Figure 11, e.g., 



Trp-RrXy-Rj-RrRz-X-Phe-Phe-Tyr-X-Thr-Glu 
-X 8 . 9 -R 3 -R 3 -Arg-R,-X 2 -Trp 



Trp-R r X4-h-h-X-h-h-R 2 -p-Phe-Phe-Tyr-X-Thr-Glu- 
X-p-X 3 -p-X 2 . 3 - R 3 -R 3 -Arg-R4-X 2 -Trp 
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MotifT 



Motif T' 



Motif 1 



Motif A 



Motif 2 



Motif B* 



Motif C 



W-X 12 -FFY-X-TE-X ](M ,-R-X 3 -W-X 7 -I 
E-X 2 -V-X 

X 3 -R-X 2 -P-K-X 3 , or, alternatively, h-R-h-X-P-K 
X-R-X-I-X or, alternatively, (F/L)-R-h-I-X 2 -h 
X 4 -F-X 3 -D-X 4 -YD-X 2 or, alternatively, P-X-L-Y-F-h-X-h-D-h- 
X 2 -C-Y-D-X-I 

Y-X 4 -G-X 2 -QG-X 3 -S-X 8 or, alternatively, K-X-Y-X-Q-X 2 -G-I- 
P_Q-G-S-X-L-S-X-h-L 

Xg-DD-X-L-Xs or, alternatively, L-L-R-L-X-D-D-X-L-h-I-T 
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When the TRT protein shown contains more than one TRT motif, the order (NH2 - 
>COOH) is as shown in Figure 1 1 . 

In one embodiment, the present invention provides isolated naturally 
5 occurring TRT proteins comprising the following supermotif: 



(NH 2 )-X 300 -6oo-W-X 12 ^^ 

pk-:wr-x-i-x^ 



including the TRT sequences disclosed herein for those reagents and the methods and 
guidance provided herein (including specific methodologies described infra), TRT 
genes and proteins can be obtained, isolated and produced in recombinant form by one 

15 of ordinary skill. For example, primers (e.g., degenerate amplification primers) are 

provided that hybridize to gene sequences encoding RT and T motifs characteristic of 
TRT. For example, one or more primers or degenerate primers that hybridize to 
sequences encoding the FFYXTE region of the T motif, other TRT motifs (as discussed 
infra), or combinations of motifs or consensus sequences, can be prepared based on the 

20 codon usage of the target organism, and used to amplify the TRT gene sequence from 
genomic DNA or cDNA prepared from the target organism. Use of degenerate primers 
is well known in the art and entails use of sets of primers that hybridize to the set of 
nucleic acid sequences that can potentially encode the amino acids of the target motif, 
taking into account codon preferences and usage of the target organism, and by using 

25 amplification (e.g., PCR) conditions appropriate for allowing base mismatches in the 
annealing steps of PCR. Typically two primer sets are used; however, single primer 
(or, in this case, a single degenerate primer set) amplification systems are well known 
and may be used to obtain TRT genes. 



30 amplify novel TRT nucleic acids, particularly those from vertebrates (e.g., humans and 
other mammals). <C N" is an equimolar mixture of all four nucleotides, and nucleotides 
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It will be apparent to one of skill that, provided with the reagents, 



Table 1 provides illustrative primers of the invention that may be used to 
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within parentheses are equimolar mixtures of the specified nucleotides. 



TABLE 1 

ILLUSTRATIVE DEGENERATE PRIMERS FOR AMPLIFICATION 
OF TRT NUCLEIC ACIDS 



motif 



direction 5'- sequence -3' 
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a FFYVTE Forward TT(CT)TT(CT)TA(CT)GTNACNGA 
b FFYVTE Reverse TCNGTNAC(GA)TA(GA)AA(GA)AA 



c RFIPKP Forward (CA)GNTT(CT)AT(ACT)CCNAA(AG)CC 

d RFIPKP Reverse GG(TC)TTNGG(TGA)AT(GA)AANC 

15 e AYJ2TI Forward GCNTA(CT)GA(CT)ACNAT 

f AYDTI Reverse TANGT(GA)TC(GA)TANGC 
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g GIPQfi 
h GIPOGS 



Forward 
Reverse 



i LVDDFL Forward 
j DDFLLVT Reverse 



GGNAT(ACT)CCNCA(AG)GG 
(GC)(AT)NCC(TC)TGNGG(TGA)ATNCC 

(CT)TNGTNGA(CT)GA(CT)Tr(CT)(CT)T 
GTNACNA(GA)NA(GA)(GA)AA(GA)TC(GA)TC 
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Preferred primer combinations (y = yes, n = no) 
Reverse 
b d f h 
n y y y 
n n y y 



Forward 
a - 
c - 
e - 

g- 
i - 



n n n 



n 



n n 



n n n n 



X 

y 
y 
y 
y 

n 



In one embodiment, an amplified TRT nucleic acid is used as a hybridization probe for 
35 colony hybridization to a library (e.g., cDNA library) made from the target organism, 
such that a nucleic acid having the entire TRT protein coding sequence, or a substantial 
portion thereof, is identified and isolated or cloned. Reagents and methods such as 
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those just described were used in accordance with the methods described herein to 
obtain TRT gene sequences of Oxytricha thfallax and Tetrahymena thermophila, as 
described in detail infra. It will be recognized that following cloning of a previously 
uncharacterized TRT gene, the sequence can be determined by routine methods and the 
5 encoded polypeptide synthesized and assayed for a TRT activity, such as telomerase 
catalytic activity (as described herein and/or by telomerase assays known in the art). 

It will also be apparent to those of skill that TRT genes may be cloned 
using any of a variety of cloning methods of the invention because the TRT motif 
sequences and the nucleic acids of the invention comprising such sequences can be 

10 used in a wide variety of such methods. For example, hybridization using a probe 

based on the sequence of a known TRT to DNA or other nucleic acid libraries from the 
target organism, as described in Example 1 can be used. It will be appreciated that 
degenerate PCR primers or their amplification products such as those described supra, 
may themselves be labeled and used as hybridization probes,, In another embodiment, 

15 expression cloning methods are used. For example, one or more antibodies that 

specifically bind peptides that span a TRT motif or other TRT epitope, such as the 
FFYXTE motif can be employed to isolate a ribosomal complex comprising a TRT 
protein and the mRNA that encodes it. For generating such antibodies of the invention, 
the peptide immunogens are typically between 6 and 30 amino acids in length, more 

20 often about 10 to 20 amino acids in length. The antibodies may also be used to probe a 
cDNA expression library derived from the organism of interest to identify a clone 
encoding a TRT sequence. In another embodiment, computer searches of DNA 
databases for DNAs containing sequences conserved with known TRTs can also be 
used to identify a clone comprising TRT sequence. 

25 In one aspect, the present invention provides compositions comprising 

an isolated or recombinant polypeptide having the amino acid sequence of a naturally 
occurring TRT protein. Usually the naturally occurring TRT has a molecular weight of 
between about 80,000 daltons (D) and about 150,000 D, most often between about 
95,000 D and about 130,000 D. Typically, the naturally occurring TRT has a net 

30 positive charge at pH 7 (calculated pi typically greater than 9). In one embodiment, the 
polypeptide exhibits a telomerase activity as defined herein. In a related embodiment, 



27 



# 



the polypeptide has a TRT-specific region (T motif) sequence; and exhibits a telomerase 
activity. The invention further provides fragments of such polypeptides. The present 
invention also provides isolated or recombinant polynucleotide having the sequence of 
a naturally occurring gene encoding a TRT protein. The invention provides regents 
5 useful for isolating sequence of a TRT from nonvertebrate (such as a yeast) and 

vertebrates, such as mammals (e.g., murine or human). The isolated polynucleotide 
may be associated with other naturally occurring or recombinant or synthetic vector 
nucleic acid sequences. Typically, the isolated nucleic acid is smaller than about 300 
kb, often less than about 50 kb, more often less than about 20 kb, frequently less than 
10 about 10 kb and sometimes less than about 5 kb or 2 kb in length. In some 

embodiments the isolated TRT polynucleotide is even smaller, such as a gene fragment, 
primer, or probe of less than about 1 kb or less than 0.1 kb. 



III. NUCLEIC ACIDS 
15 A) GENERALLY 

The present invention provides isolated and recombinant nucleic acids 
having a sequence of a polynucleotide encoding a telomerase catalytic subunit protein 
(TRT), such as a recombinant TRT gene from Euplotes, Tetrahymena, S. pombe or 
humans. Exemplary polynucleotides are provided in Figure 13 (Euplotes); Figure 15 
20 (£ pombe) and Figure 16 (human, GenBank Accession No. AF015950). The present 
invention provides sense and anti-sense polynucleotides having a TRT gene sequence, 
including probes, primers, TRT-protein-encoding polynucleotides, and the like. 



B) HUMAN TRT 

25 The present invention provides nucleic acids having a sequence of a 

telomerase catalytic subunit from humans (i.e., hTRT). 

In one aspect, the invention provides a polynucleotide having a sequence 

or subsequence of a human TRT gene or RNA. In one embodiment, the polynucleotide 

of the invention has a sequence of SEQUENCE ID NO: 1 shown in Figure 16 or a 
30 subsequence thereof. In another embodiment, the polynucleotide has a sequence of 

SEQUENCE ID NO: 3 (Figure 18), SEQUENCE ID NO: 4 (Figure 20), or 
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subsequences thereof. The invention also provides polynucleotides with substantial 
sequence identity to the hTRT nucleic acid sequences disclosed herein, e.g., including 
but not limited to SEQUENCE ID NOS: 1 [Figure 16], 4 [Figure 20], 6 [Figure 21], 
and 7 [Figure 12]). Thus, the invention provides naturally occurring alleles of human 
TRT genes and variant polynucleotide sequences having one or more nucleotide 
deletions, insertions or substitutions relative to an hTRT nucleic acid sequence 
disclosed herein. As described infra, variant nucleic acids may be produced using the 
recombinant or synthetic methods described below or by other means. 

The invention also provides isolated and recombinant polynucleotides 
having a sequence from a flanking region of a human TRT gene. Such polynucleotides 
include those derived from genomic sequences of untranslated regions of the hTRT 
mRNA. An exemplary genomic sequence is shown in Figure 21 (SEQUENCE ID NO: 
6). As described in Example 4, SEQUENCE ID NO: 6 was obtained by sequencing a 
clone, AXK&5 isolated from a human genomic library. Lambda G<£5 contains a 15 
kilobasepair (kbp) insert including approximately 13,000 bases 5 ! to the hTRT coding 
sequences. This clone contains hTRT promoter sequences and other hTRT gene 
regulatory sequences (e.g., enhancers). 

The invention also provides isolated and recombinant polynucleotides 
having a sequence from an intronic region of a human TRT gene. An exemplary 
intronic sequence is shown in Figure 12 (SEQUENCE ID NO: 7; see Example 3). In 
some embodiments, hTRT introns are included in "minigenes" for improved expression 
of hTRT proteins in eukaryotic cells. 

In a related aspect, the present invention provides polynucleotides that 
encode hTRT proteins or protein fragments, including modified, altered and variant 
hTRT polypeptides. In one embodiment, the encoded hTRT protein or fragment has an 
amino acid sequence as set forth in Figure 17 (SEQUENCE ID NO: 2), or with 
conservative substitutions of SEQUENCE ID NO: 2. In one embodiment, the encoded 
hTRT protein or fragment has substitutions that change an activity of the protein (e.g., 
telomerase catalytic activity). 

It will be appreciated that, as a result of the degeneracy of the genetic 
code, the nucleic acid encoding the hTRT protein need not have the sequence of a 
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naturally occurring hTRT gene, but that a multitude of polyn ucleotides can encode an 
hTRT polypeptide having an amino acid sequence of SEQUENCE ID NO: 2. The 
present invention provides each and every possible variation of nucleotide sequence 
that could be made by selecting combinations based on possible codon choices made in 
accordance with known triplet genetic codes, and all such variations are specifically 
disclosed hereby. Thus, although in some cases hTRT polyp eptide-encoding nucleotide 
sequences that are capable of hybridizing to the nucleotide sequence of the naturally 
occurring sequence (under appropriately selected conditions of stringency) are 
preferred, it may be advantageous in other cases to produce nucleotide sequences 
encoding hTRT that employ a substantially different codon usage and so perhaps do not 
hybridize to nucleic acids with the naturally occurring sequence. 

In particular embodiments, the invention provides hTRT oligo- and 
polynucleotides that comprise a subsequence of an hTRT nucleic acid disclosed herein 
(e.g., SEQUENCE ID NOS: 1 and 6). The nucleic acids of the invention typically 
comprise at least about 10, more often at least about 12 or about 15 consecutive bases 
of the exemplified hTRT polynucleotide. Often, the nucleic acid of the invention will 
comprise a longer sequence, such as at least about 25, about 50, about 100, about 200, 
or at least about 500 to 3000 bases in length, for example when expression of a 
polypeptide, or full length hTRT protein is intended. 

In still other embodiments, the present invention provides "Al 82 hTRT" 
polynucleotides having a sequence identical or complementary to naturally occurring or 
non-naturally occurring hTRT polynucleotides such as SEQUENCE ID NO: 3 or 
SEQUENCE ID NO: 4, which do not contain the 1 82 nucleotide sequence 
(SEQUENCE ID NO: 9 [Figure 24]) found in pGRN121 (and also absent in clone 
712562). These polynucleotides are of interest, in part, because they encode 
polypeptides that contain different combinations or arrangements of TRT motifs than 
found in the "full-length" hTRT polypeptide (SEQUENCE ID NO: 2) such as is 
encoded by pGRN121. As discussed infra, it is contemplated that these polypeptides 
may play a biological role in nature (e.g., in regulation of telomerase expression in 
cells) and/or find use as therapeutics (e.g., as dominant-negative products that inhibit 
function of wild-type proteins), or have other roles and uses, e.g. as described herein. 
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For example, in contrast to the polypeptide encoded by pGRN121, clone 
712562 encodes a 259 residue protein with a calculated molecular weight of 
approximately 30 kD (hereinafter, "712562 hTRT"). The 712562 hTRT polypeptide 
(SEQUENCE ID NO: 10 [Figure 19]) contains motifs T, 1, 2, and A, but not motifs B\ 

5 C, D and E (See Figure 4). Similarly, a variant hTRT polypeptide with therapeutic and 
other activities may be expressed from a nucleic acid similar to the pGRN121 cDNA 
but lacking the 182 basepairs missing in clone 712562, e.g., having the sequence shown 
in Figure 20 (SEQUENCE ID NO: 4). This nucleic acid (hereinafter, "pro90 hTRT"), 
which may be synthesized using routine synthetic or recombinant methods as described 

1 0 herein, encodes a protein of 807 residues (calculated molecular weight of 

approximately 90 kD) that shares the same amino terminal sequence as the hTRT 
protein encoded by SEQUENCE ID NO: 1, but diverges at the carboxy-terminal region 
(the first 763 residues are common, the last 44 residues of pro90 hTRT are different 
than "full-length" hTRT). The pro90 hTRT polypeptide contains motifs T, 1, 2, and A, 

15 but not motifs B, C, D, E, and thus may have some, but not likely all telomerase 
activities. 

C) PRODUCTION OF HUMAN TRT NUCLEIC ACIDS 

The polynucleotides of the invention have numerous uses including, but 

20 not limited to, expression of polypeptides encoding hTRT or fragments thereof, use as 
sense or antisense probes or primers for hybridization and/or amplification of naturally 
occurring hTRT genes or RNAs (e.g. for diagnostic or prognostic applications), and as 
therapeutic agents (e.g., in antisense, triplex, or ribozyme compositions). As will be 
apparent upon review of the disclosure, these uses will have enormous impact on the 

25 diagnosis and treatment of human diseases relating to aging, cancer, and fertility as well 
as the growth, reproduction, and manufacture of cell-based products. As described in 
the following sections, the hTRT nucleic acids of the invention may be made (e.g., 
cloned, synthesized, or amplified) using techniques well known in the art. 
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1) CLONING, AMPLIFICATION, AND RECOMBINANT 
PRODUCTION 

In one embodiment, hTRT genes or cDNAs are cloned using a nucleic 
acid probe that specifically hybridizes to an hTRT mRNA, cDNA, or genomic DNA. 
5 One suitable probe for this purpose is a polynucleotide having all or part of the 

sequence provided in Figure 16 (SEQUENCE ID NO: 1), such as a probe comprising a 
subsequence thereof. Typically, the target hTRT genomic DNA or cDNA is ligated 
into a vector (e.g., a plasmid, phage, virus, yeast artificial chromosome, or the like) and 
may be isolated from a genomic or cDNA library (e.g., a human placental cDNA 
1 0 library). Once an hTRT nucleic acid is identified, it can be isolated according to 
standard methods known to those of skill in the art. An illustrative example of 
screening a human cDNA library for the hTRT gene is provided in Example 1 ; 
similarly, an example of screening a human genomic library is found in Examples 3 and 
4. Cloning methods are well known and are described, for example, in Sambrook et al., 
1 5 (1989) Molecular Cloning: A Laboratory Manual, 2nd Ed., Vols. 1 -3, Cold 
Spring Harbor Laboratory hereinafter, "Sambrook"); Berger and Kimmel, (1987) 
Methods in Enzymology, Vol. 152: Guide to Molecular Cloning Techniques, 
San Diego: Academic Press, Inc.; Ausubel et al., Current Protocols IN Molecular 
Biology, Greene Publishing and Wiley-Interscience, New York (1997); Cashion et al., 
20 U.S. Patent No. 5,017,478; and Carr, European Patent No. 0,246,864. 

The invention also provides hTRT genomic or cDNA nucleic acids 
isolated by amplification methods such as the polymerase chain reaction (PCR). In one 
embodiment, hTRT protein coding sequence is amplified from an RNA or cDNA 
sample (e.g., double stranded placental cDNA (Clontech, Palo Alto CA)) using the 
25 primers 5'-GTGAAGGCACTGTTCAGCG-3' ("TCP1 . 1 ") and 

5-CGCGTGGGTGAGGTGAGGTG-3 ("TCP 1.15"). In some embodiments a third 
primer or second pair of primers may be used, e.g., for "nested PCR", to increase 
specificity. One example of a second pair of primers is 5'- 
CTGTGCTGGGCCTGGACGATA-3' ("billTCP6") and 5'- 
30 AGCTTGTTCTCCATGTCGCCGTAG-3' ("TCP1 . 14"). It will be apparent to those of 
skill that numerous other primers and primer combinations, useful for amplification of 
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hTRT nucleic acids are provided by the present invention. 

Moreover, the invention provides primers that amplify any specific 
region (e.g., coding regions, promoter regions, and/or introns) or subsequence of hTRT 
genomic DNA, cDNA or RNA. For example, the hTRT intron at position 274/275 of 

5 SEQUENCE ID NO: 1 (see Example 3) may be amplified (e.g., for detection of 
genomic clones) using primers TCP1.57 and TCP1.52 (primer pair 1) or primers 
TCP 1.49 and TCP 1.50 (primer pair 2). (Primer names refer to primers listed in Table 
2, infra.) The primer pairs can be used individually or in a nested PCR where primer 
set 1 is used first. Another illustrative example relates to primers that specifically 

1 0 amplify and so detect the 5' end of the hTRT mRNA or the exon encoding the 5' end of 
hTRT gene (e.g., to assess the size or completeness of a cDNA clone). The following 
primer pairs are useful for amplifying the 5' end of hTRT: primers K320 and K321 
(primer pair 3); primers K320 and TCP1 .61 (primer pair 4); primers K320 and K322 
(primer pair 5). The primer sets can be used in a nested PCR in the order set 5, then set 

15 4 or set 3 , or set 4 or set 5, then set 3 . Yet another illustrative example involves primers 
chosen to amplify or detect specifically the conserved hTRT TRT motif region 
comprising approximately the middle third of the mRNA (e.g., for use as a 
hybridization probe to identify TRT clones from, for example, nonhuman organisms). 
The following primer pairs are useful for amplifying the TRT motif region of hTRT 

20 nucleic acids: primers K304 and TCP 1 .8 (primer pair 6), or primers LT1 and TCP1 . 1 5 
(primer pair 7). The primer sets can be used in a nested PCR experiment in the order 
set 6 then set 7. 

Suitable PCR amplification conditions are known to those of skill and 
include (but are not limited to) 1 unit Taq polymerase (Perkin Elmer, Norwalk CT), 100 

25 uM each dNTP (dATP, dCTP, dGTP, dTTP), lx PCR buffer (50 mM KC1, 10 mM 

Tris, pH 8.3 at room temperature, 1.5 mM MgCl 2 , 0.01% gelatin) and 0.5 uM primers, 
with the amplification run for about 30 cycles at 94° for 45 sec, 55° for 45 sec and 72° 
for 90 sec. It will be recognized by those of skill in the art that other thermostable 
DNA polymerases, reaction conditions, and cycling parameters will also provide 

30 suitable amplification. Other suitable in vitro amplification methods that can be used to 
obtain hTRT nucleic acids include, but are not limited to, those herein, infra. Once 
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amplified, the hTRT nucleic acids can be cloned, if desired, into any of a variety of 
vectors using routine molecular biological methods or detected or otherwise utilized in 
accordance with the methods of the invention. 

One of skill will appreciate that the cloned or simplified hTRT nucleic 
5 acids obtained as described above can be prepared or propagated using other methods, 
such as chemical synthesis or replication by transformation into bacterial systems, such 
as E. coli (see, e.g., Ausubel et al., supra), or eukaryotic, such as mammalian, 
expression systems. Similarly, hTRT RNA can be expressed in accordance with the 
present in vitro methods, or in bacterial systems such as E. coli using, for example, 

10 commercially available vectors containing promoters recognized by an RNA 
polymerase such as T7, T3 or SP6, or transcription of DNA generated by PCR 
amplification using primers containing an RNA polymerase promoter. 

The present invention further provides altered or modified hTRT nucleic 
acids. It will be recognized by one of skill that the cloned or amplified hTRT nucleic 

15 acids obtained can be modified (e.g., truncated, derivatized, altered) by methods well 
known in the art (e.g., site-directed mutagenesis, linker scantling mutagenesis) or 
simply synthesized de novo as described below. The altered or modified hTRT nucleic 
acids are useful for a variety of applications, including, but not limited to, facilitating 
cloning or manipulation of an hTRT gene or gene product, or expressing a variant 

20 hTRT gene product. For example, in one embodiment, the hTRT gene sequence is 

altered such that it encodes an hTRT polypeptide with altered properties or activities, as 
discussed in detail in infra, for example, by mutation in a conserved motif of hTRT. In 
another illustrative example, the mutations in the protein coding region of an hTRT 
nucleic acid may be introduced to alter glycosylation patterns, to change codon 

25 preference, to produce splice variants, remove protease-sensitive sites, create antigenic 
domains, modify specific activity, and the like. In other embodiments, the nucleotide 
sequence encoding hTRT and its derivatives is changed without altering the encoded 
amino acid sequences, for example, the production of RNA transcripts having more 
desirable properties, such as increased translation efficiency or a greater or a shorter 

30 half-life, compared to transcripts produced from the naturally occurring sequence. In 
yet another embodiment, altered codons are selected to increase the rate at which 
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expression of the peptide occurs in a particular prokaryotic or eukaryotic expression 
host in accordance with the frequency with which particular codons are utilized by the 
host. Useful in vitro and in vivo recombinant techniques that can be used to prepare 
variant hTRT polynucleotides of the invention are found in Sambrook et al. and 
5 Ausubel et al., both supra. 

As noted supra, the present invention provides nucleic acids having 
flanking (5' or 3 1 ) and intronic sequences of the hTRT gene. The nucleic acids are of 
interest, inter alia, because they contain promoter and other regulatory elements 
involved in hTRT regulation and useful for expression of hTRT and other recombinant 

10 proteins or RNA gene products. It will be apparent that, in addition to the nucleic acid 
sequences provided in SEQUENCE ID NOS: 6 and 7, additional hTRT intron and 
flanking sequences may be readily obtained using routine molecular biological 
techniques. For example, additional hTRT genomic sequence may be obtained from 
Lambda clone G<&5 (ATCC Accession No. 209024), described supra and in Example 4. 

15 Still other hTRT genomic clones and sequences may be obtained by screening a human 
genomic library using an hTRT nucleic acid probe having a sequence or subsequence 
from SEQUENCE ID NO: 1. Additional clones and sequences (e.g., still further 
upstream) may be obtained by using labeled sequences or subclones derived from 
A,G<&5 to probe appropriate libraries. Other useful methods for further characterization 

20 of hTRT flanking sequences include those general methods described by Gobinda et al., 
1993, PCR Meth Applic. 2:318; Triglia et al, 1988, Nucleic Acids Res. 16:8186; 
Lagerstrom et al., 1 99 1 , PCR Methods Applic. 1 : 1 1 1 ; and Parker et al., 1991, Nucleic 
Acids Res. 19:3055. 

Intronic sequences can be identified by routine means such as by 

25 comparing the hTRT genomic sequence with hTRT cDNA sequences (see, e.g., 

Example 3), by SI analysis (see Ausubel et al., supra, at Chapter 4), or various other 
means known in the art. Intronic sequences can also be found in pre-mRNA (i.e., 
unspliced or incompletely spliced mRNA precursors), which may be amplified or 
cloned following reverse transcription of cellular RNA. 

30 When desired, the sequence of the cloned, amplified, or otherwise 

synthesized hTRT or other TRT nucleic acid can be determined or verified using DNA 
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sequencing methods well known in the art (see, e.g., Ausubel et al., supra). Useful 
methods of sequencing employ such enzymes as the Klenow fragment of DNA 
polymerase I, Sequenase (US Biochemical Corp, Cleveland OH), Tag DNA polymerase 
(Perkin Elmer, Norwalk CT), thermostable T7 polymerase (Amersham, Chicago IL), or 
combinations of recombinant polymerases and proofreading exonucleases such as the 
ELONGASE Amplification System marketed by Gibco BRL (Gaithersburg MD). 
When sequencing or verifying the sequence of oligonucleotides (such as 
oligonucleotide made de novo by chemical synthesis), the method of Maxam and 
Gilbert may be preferred (Maxam and Gilbert, 1980, Meth Enz. 65:499; Ausubel et al., 
supra, Ch. 7). 

The 5' untranslated sequences of hTRT or other TRT mRNAs can be 
determined directly by cloning a "full-length 1 ' hTRT or other cDNA using standard 
methods such as reverse transcription of mRNA, followed by cloning and sequencing 
the resulting cDNA. Preferred oligo(dT)-primed libraries for screening or amplifying 
full length cDNAs that have been size-selected to include larger cDNAs may be 
preferred. Random primed libraries are also suitable and often include a larger 
proportion of clones that contain the 5' regions of genes. Other well known methods 
for obtaining 5' RNA sequences, such as the RACE protocol described by Frohman et 
al., 1988, Proc. Nat Acad Sci USA 85:8998, may also be used. If desired, the 
transcription start site of an hTRT or other TRT mRNA can be determined by routine 
methods using the nucleic acids provided herein (e.g., having a sequence of 
SEQUENCE ID NO: 1). One method is SI nuclease analysis (Ausubel et al., supra) 
using a labeled DNA having a sequence from the 5' region of SEQUENCE ID NO: 1 . 

2) CHEMICAL SYNTHESIS OF NUCLEIC ACIDS 

The present invention also provides hTRT polynucleotides (RNA, DNA 
or modified) that are produced by direct chemical synthesis. Chemical synthesis is 
generally preferred for the production of oligonucleotides or for oligonucleotides and 
polynucleotides containing nonstandard nucleotides (e.g., probes, primers and antisense 
oligonucleotides). Direct chemical synthesis of nucleic acids can be accomplished by 
methods known in the art, such as the phosphotriester meth od of Narang et al., 1979, 
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Meth Enzymol 68:90; the phosphodiester method of Brown et al., Meth Enzymol 
68:109 (1979); the diethylphosphoramidite method of Beaucage et al., Tetra. Lett, 
22:1859 (1981); and the solid support method of U.S. Patent No. 4,458,066. Chemical 
synthesis typically produces a single stranded oligonucleotide, which may be converted 
into double stranded DNA by hybridization with a complementary sequence, or by 
polymerization with a DNA polymerase and an oligonucleotide primer using the single 
strand as a template. One of skill will recognize that while chemical synthesis of DNA 
is often limited to sequences of about 100 or 150 bases, longer sequences may be 
obtained by the ligation of shorter sequences or by more elaborate synthetic methods. 

It will be appreciated that the hTRT (or hTR or other) polynucleotides 
and oligonucleotides of the invention can be made using nonstandard bases (e.g., other 
than adenine, cytidine, guanine, thymine, and uridine) or nonstandard backbone 
structures to provides desirable properties (e.g., increased nuclease-resistance, 
tighter-binding, stability or a desired Tm). Techniques for rendering oligonucleotides 
nuclease-resistant include those described in PCT publication WO 94/12633. A wide 
variety of useful modified oligonucleotides may be produced, including 
oligonucleotides having a peptide-nucleic acid (PNA) backbone (Nielsen et al., 1991, 
Science 254:1497) or incorporating 2 f -0-methyl ribonucleotides, phosphorothioate 
nucleotides, methyl phosphonate nucleotides, phosphotriester nucleotides, 
phosphorothioate nucleotides, phosphoramidates. Still other useful oligonucleotides 
may contain alkyl and halogen-substituted sugar moieties comprising one of the 
following at the 2' position: OH, SH, SCH 3 , F, OCN, 0CH 3 0CH 3 , OCH 3 0(CH 2 ) n CH 3 , 
0(CH 2 ) n NH 2 or 0(CH 2 ) n CH 3 where n is from 1 to about 10; C x to C l0 lower alkyl, 
substituted lower alkyl, alkaryl or aralkyl; CI; Br; CN; CF 3 ; OCF 3 ; 0-, S-, or N-alkyl; 
0-, S-, or N-alkenyl; SOCH 3 ; S0 2 CH 3 ; ON0 2 ; N0 2 ; N 3 ; NH 2 ; heterocycloalkyl; 
heterocycloalkaryl; aminoalkylamino; polyalkylamino; substituted silyl; an RNA 
cleaving group; a cholesteryl group; a folate group; a reporter group; an intercalator; a 
group for improving the pharmacokinetic properties of an oligonucleotide; or a group 
for improving the pharmacodynamic properties of an oligonucleotide and other 
substituents having similar properties. Folate, cholesterol or other groups which 
facilitate oligonucleotide uptake, such as lipid analogs, may be conjugated directly or 
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via a linker at the 2 ! position of any nucleoside or at the 3 f or 5' position of the 3 r - 
terminal or S'-terminal nucleoside, respectively. One or more such conjugates may be 
used. Oligonucleotides may also have sugar mimetics such as cyclobutyls in place of 
the pentofuranosyl group. Other embodiments may include at least one modified base 

5 form or "universal base" such as inosine, or inclusion of other nonstandard bases such 
as queosine and wybutosine as well as acetyl-, methyl-, thio- and similarly modified 
forms of adenine, cytidine, guanine, thymine, and uridine which are not as easily 
recognized by endogenous endonucleases. The invention further provides 
oligonucleotides having backbone analogues such as phosphodiester, phosphorothioate, 

1 0 phosphorodithioate, methylphosphonate, phosphoramidate, alkyl phosphotriester, 
sulfamate, 3'-thioacetal, methylene(methylimino), 3-N-carbamate, morpholino 
carbamate, chiral-methyl phosphonates, nucleotides with short chain alkyl or cycloalkyl 
intersugar linkages, short chain heteroatomic or heterocyclic intersugar ("backbone") 
linkages, or CH 2 -NH-0-CH 2 , CH 2 -N(CH 3 )-OCH 2 , CH 2 -0-N(CH 3 )-CH 2 , 

1 5 CH 2 -N(CH 3 )-N(CH 3 )-CH 2 and 0-N(CH 3 )-CH 2 -CH 2 backbones (where phosphodiester 
is 0-P-0-CH 2 ), or mixtures of the same. Also useful are oligonucleotides having 
morpholino backbone structures (U.S. Patent No. 5,034,506), 

Useful references include Oligonucleotides and Analogues, A Practical 
Approach, edited by F. Eckstein, IRL Press at Oxford University Press (1991); 

20 Antisense Strategies, Annals of the New York Academy of S ciences, Volume 600, Eds. 
Baserga and Denhardt (NYAS 1992); Milligan et al, 9 July 1993, J. Med. Chem. 
36(14):1923-1937; Antisense Research and Applications (1993, CRC Press), in its 
entirety and specifically Chapter 15, by Sanghvi, entitled "Heterocyclic base 
modifications in nucleic acids and their applications in antisense oligonucleotides." 

25 Antisense Therapeutics, ed. Sudhir Agrawal (Humana Press, Totowa, New Jersey, 
1996). 
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D) LABELING NUCLEIC ACIDS 

It is often useful to label the nucleic acids of the invention, for example, 
when the hTRT or other oligonucleotides or polynucleotides are to be used as nucleic 
acid probes. The labels (see infra) may be incorporated by any of a number of means 
well known to those of skill in the art. In one embodiment, an unamplified nucleic 
acid (e.g., mRNA, polyA mRNA, cDNA) is labeled. Means of producing labeled 
nucleic acids are well known to those of skill in the art and include, for example, nick- 
translation, random primer labeling, end-labeling (e.g. using a kinase), and chemical 
conjugation (e.g., photobiotinylation) or synthesis. In another embodiment, the label is 
simultaneously incorporated during an amplification step in the preparation of the 
sample nucleic acids. Thus, for example, polymerase chain reaction (PCR) or other 
nucleic acid amplification method with labeled primers or labeled nucleotides will 
provide a labeled amplification product. In another embodiment, transcription 
amplification using a labeled nucleotide (e.g. fluorescein-labeled UTP and/or CTP) 
incorporates a label into the transcribed nucleic acids. An amplification product may 
also, or alternatively, be labeled after the amplification is completed. 

E) ILLUSTRATIVE OLIGONUCLEOTIDES 

As noted supra and discussed in detail infra, oligonucleotides are used 
for a variety of uses including as primers, probes, therapeutic or other antisense 
oligonucleotides, triplex oligonucleotides, and numerous other uses as apparent from 
this disclosure. Table 2 provides certain illustrative specific oligonucleotides that may 
be used in the practice of the invention. It will be appreciated that numerous other 
useful oligonucleotides of the invention may be synthesized by one of skill, following 
the guidance provided herein. 

In Table 2, "seq" means that the primer has been used, or is useful, for 
sequencing; "PCR" means that the primer has been used, or is useful, for PCR; "AS" 
means that means that the primer has been used, or is useful for antisense inhibition of 
telomerase activity; "CL" means that the primer has been used, or is useful in cloning 
regions of hTRT genes or RNA, "mut" means that the primer has been used, or is 
useful for constructing mutants of hTRT genes or gene products. "UC" means "upper 
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case/' and "lc" means "lower case." Mismatches and insertions (relative to 
SEQUENCE ID NO: 1) are indicated by underlining; deletion s are indicated by a 
It will be appreciated that nothing in Table 2 is intended to limit the use of any 
particular oligonucleotide to any single use or set of uses. 
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IV- TRT PROTEINS AND PEPTIDES 
A) GENERALLY 

The invention provides a wide variety of hTRT proteins useful for, inter 
alia, production of telomerase activity, inhibition of telomerase activity in a cell, 
5 induction of an anti-hTRT immune response, as a therapeutic reagent, as a standard or 
control in a diagnostic assay, as a target in a screen for compo unds capable of activation 
or inhibition of an activity of hTRT or telomerase, and numerous other uses that will be 
apparent to one of skill or are otherwise described herein. The hTRT of the invention 
include functionally active proteins (useful for e.g., conferring telomerase activity in a 

10 telomerase-negative cell) and variants, inactive variants (useful for e.g., inhibiting 
telomerase activity in a cell), hTRT polypeptides, and telomerase RNPs (e.g., 
ribonucleoprotein complexes comprising the proteins) that exhibit one, several, or all of 
the functional activities of naturally occurring hTRT and telomerase, as discussed in 
greater detail for illustrative purposes, below. 

1 5 In one embodiment, the hTRT protein of the invention is a polypeptide 

having a sequence as set forth in Figure 17 (SEQUENCE ID NO: 2), or a fragment 
thereof. In another embodiment, the hTRT polypeptide differs from SEQUENCE ID 
NO: 2 by internal deletions, insertions, or conservative substitutions of amino acid 
residues. In a related embodiment, the invention provides hTRT polypeptides with 

20 substantial similarity to SEQUENCE ID NO: 2. The invention further provides hTRT 
polypeptides that are modified, relative to the amino acid sequence of SEQUENCE ID 
NO: 2, in some manner, e.g., truncated, mutated, derivatized, or fused to other 
sequences (e.g., to form a fusion protein). Moreover, the present invention provides 
telomerase RNPs comprising an hTRT protein of the invention complexed with a 

25 template RNA (e.g., hTR). In other embodiments, one or more telomerase-associated 
proteins is associated with hTRT protein and/or hTR. 

The invention also provides other naturally occurring hTRT species or 
nonnaturally occurring variants, such as proteins having the sequence of, or substantial 
similarity to SEQUENCE ID NO: 5 [ [Figure 20], SEQUENCE ID NO: 10 [Figure 19], 

30 and fragments, variants, or derivatives thereof. 

The invention provides still other hTRT species and variants. One 
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example of an hTRT variant may result from ribosome frameshifting of mRNA 
encoded by the clone 712562 (SEQUENCE ID NO: 3 [Figure 18]) or the pro90 variant 
hTRT shown in SEQUENCE ID NO: 4 [Figure 20] and so result in the synthesis of 
hTRT polypeptides containing all the TRT motifs (for a general example, see, e.g., 
Tsuchihashi et al., 1990, Proc. Natl Acad. Sci. USA 87:2516; Craigengen et al., 1987, 
Cell 50:1; Weiss, 1990, Cell 62:1 17). Ribosome frameshifting can occur when specific 
mRNA sequences or secondary structures cause the ribosome to "stall" and jump one 
nucleotide forwards or back in the sequence. Thus, a ribosome frameshift event on the 
712562 mRNA could cause the synthesis of an approximately 523 amino acid residue 
polypeptide. A ribosome frameshift event on the pro90 sequence could result in a 
protein with approximately 1071 residues. It will be appreciated that proteins resulting 
from ribosome frameshifting can also be expressed by synthetic or recombinant 
techniques provided by the invention. 

Human TRT proteins, peptides, and functionally equivalent proteins 
may be obtained by purification, chemical synthesis, or recombinant production, as 
discussed in greater detail below. 

B) TRT PROTEIN ACTIVITIES 

The TRT polypeptides of the invention (including fragments, variants, 
products of alternative alleles, and fusion proteins) can have one or more, or all of the 
functional activities associated with native hTRT. Except as noted, as used herein, an 
hTRT or other TRT polypeptide is considered to have a specified activity if the activity 
is exhibited by either the hTRT protein without an associated RNA (e.g., hTR) or in an 
hTRT-associated RNA (e.g., hTR) complex. The hTR-binding activity of hTRT is one 
example of an activity associated with the hTRT protein. Methods for producing 
complexes of nucleic acids (e.g., hTR) and the hTRT polypeptides of the invention are 
described infra. 

Modification of the hTRT protein (e.g., by chemical or recombinant 
means, including mutation or modification of a polynucleotide encoding the hTRT 
polypeptide or chemical synthesis of a polynucleotide that has a sequence different than 
a native polynucleotide sequence) to have a different complement of activities than 



48 



# 



native hTRT can be usefixl in therapeutic applications or in screening for specific 
modulators of hTRT or telomerase activity. In addition, assays for various hTRT 
activities can be particularly useful for identification of agents (e.g., activity modulating 
agents) that interact with hTRT or telomerase to change telomerase activity. 
5 The activities of native hTRT, as discussed infra, include telomerase 

catalytic activity (which may be either processive or non-processive activity); 
telomerase processivity; conventional reverse transcriptase activity; nucleolytic 
activity; primer or substrate (telomere or synthetic telomerase substrate or primer) 
binding activity; dNTP binding activity; RNA (i.e., hTR) binding activity; and protein 
10 binding activity (e.g., binding to telomerase-associated proteins, telomere-binding 

proteins, or to a protein-telomeric DNA complex). It will be understood, however, that 
present invention also provides hTRT compositions without any particular hTRT 
activity but with some useful activity related to the hTRT or other TRT proteins (e.g., 
certain typically short immunogenic peptides, inhibitory peptides). 

15 

1) TELOMERASE CATALYTIC ACTIVITY 

As used herein, a polypeptide of the invention has "telomerase catalytic 
activity," when the polypeptide is capable of extending a DNA primer that functions as 
a telomerase substrate by adding a partial, one, or more than one repeat of a sequence 

20 (e.g., TTAGGG) encoded by a template nucleic acid (e.g., hTR). This activity may be 
processive or nonprocessive. Processive activity occurs when a telomerase RNP adds 
multiple repeats to a primer or telomerase before the DNA is released by the enzyme 
complex. Non-processive activity occurs when telomerase adds a partial, or only one, 
repeat to a primer and is then released. In vivo, however, a non-processive reaction 

25 could add multiple repeats by successive rounds of association, extension, and 

dissociation. This can occur in vitro as well, but it is not typically observed in standard 
assays due to the vastly large molar excess of primer over telomerase in standard assay 
conditions. 

To characterize an hTRT polypeptide as having non-processive activity, a 
30 conventional telomerase reaction is performed using conditions that favor a 

non-processive reaction, for example high temperatures (i.e., 35-40 °C, typically 37 °C), 
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low dGTP concentrations (1 \xM or less), high primer concentrations (5 |iM or higher), 
and high dATP/TTP concentrations (2 mM or higher), with the temperature and dGTP 
typically having the greatest effect. To characterize an hTRT polypeptide as having 
processive activity, a conventional telomerase reaction is performed using conditions 
5 that favor a processive reaction (for example, 27-34°C, typically 30°C), high dGTP 
concentration (10 jiM or higher), low primer concentration (1 jaM or lower), and/or low 
dATP and TTP concentrations (0.3-1 mM) with temperature and dGTP typically 
concentration being the most critical. Alternatively, a TRAP assay (for processive or 
moderately processive activity) or the dot-blot and gel blot assays (for processive 

1 0 activity) may be used. The hTRT polypeptide of the invention can possess a 

non-processive activity, but not a processive activity (e.g., if an alteration of the hTRT 
polypeptide reduces or eliminates the ability to translocate), can be solely processive, or 
can possess both activities. 

a) Non-processive Activity 

15 A non-processive telomerase catalytic activity can extend the DNA 

primer from the position where the 3' end anneals to the RNA template to the 5' end of 
the template sequence, typically terminating with the addition of the first G residue (as, 
for example, when the template is hTR). As shown below, the exact number of 
nucleotides added is dependent on the position of the 3 1 terminal nucleotide of the 

20 primer in the TTAGGG repeat sequence. 

NONPROCESSIVE ACTIVITY 

i) TTAGGG t tag (DNA) 

25 3 1 AUCCCAAUC 5 1 (RNA) 

ii) TTAGggttag (DNA) 

3 ' AUCCCAAUC 5 " (RNA) 

30 In DNA, UC = primer, 1c = added nucleotides 

Thus, 4 nucleotides are added to the —TTAGGG primer (i) while 6 
nucleotides are added to the — TTAG primer (ii). The first repeat added by telomerase 
in a processive reaction is equivalent to this step; however, in a processive reaction 
35 telomerase performs a translocation step where the 3' end is released and re-bound at 
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the 3* region of the template in a position sufficient to prime addition of another repeat 
(see Morin, 1997, Eur. 1 Cancer 33:750). 

A fully non-processive reaction produces only one band in a 
conventional assay using a single synthetic primer. Because this result could also be 
5 produced by other enzymes, such as a terminal transferase activity, it may be desirable 
in some applications to verify that the product is a result of a telomerase catalytic 
activity. A telomerase (comprising hTRT) generated band can be distinguished by 
several additional characteristics. The number of nucleotides added to the end of the 
primer should be consistent with the position of the primer 3* end. Thus, a — TTAGGG 

1 0 primer should have 4 nucleotides added and a — TTAG primer should have 6 

nucleotides added (see above). In practice, two or more sequence permuted primers can 
be used which have the same overall length but different 5' and 3 r endpoints. As an 
illustrative example, the non-processive extension of primers 5 f - 
TTAGGGTTAGGGTTAGGG and 5-GTTAGGGTTAGGGTTAGG will generate 

1 5 products whose absolute length will be one nucleotide different (4 added to 5- 
TTAGGGTTAGGGTTAGGG for a 22 nt total length, and 5 added to 5'- 
GTTAGGGTTAGGGTTAGG for a 23 nt total length). The nucleotide dependence of 
the reaction should be consistent with the position of the primer terminus. Thus, a 
-TTAGGG primer product should require dGTP, TTP, and dATP, but not dCTP, and a 

20 — AGGGTT primer product should require dGTP and dATP, but not TTP or dCTP. 

The activity should be sensitive to RNAase or micrococcal nuclease pre-treatment (see 
Morin, 1989, Cell 59: 521) under conditions that will degrade hTR and so eliminate the 
template. 

25 b) Processive Activity 

In practice, a processive activity is easily observed by the 

appearance of a six nucleotide ladder in a conventional assay, TRAP assay, or gel-blot 

assay. A dot-blot assay can also be used, but no ladder is detected in such a method. 

The conventional assay is described in Morin, 1989, Cell 59:521, which is incorporated 
30 herein in its entirety and for all purposes. The TRAP assay is described in U.S. Patent 

No. 5,629,154; see also, PCT publication WO 97/15687, PCT publication WO 
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95/13381; Krupp et al. Nucleic Acids Res., 1997, 25: 919; and Wright et al., 1995, Nuc. 
Acids Res. 23:3794, each of which is incorporated herein in its entirety and for all 
purposes. The dot blot immunoassay is described in detail in co-pending U.S. Patent 
Application Serial Number 08/833,377, filed April 14, 1997, which is incorporated 
herein by reference in its entirety and for all purposes. The dot blot assay can be used 
in a format in which a non-processive activity, which does not add the 3 or more 
repeats required for stable hybridization of the (CCCUAA)n probe used to detect the 
activity, is tested with compounds or hTRT variants to determine if the same generates 
processivity, i.e., if the probe detects an expected telomerase substrate, then the 
compound or mutant is able to change the non-processive activity to a processive 
activity. Other assays for processive telomerase catalytic activity can also be used, e.g., 
the stretch PCR assay of Tatematsu et al., 1996, Oncogene 13:2265. The gel-blot 
assay, a combination of the conventional and dot blot assays can also be used. In this 
variation a conventional assay is performed with no radiolabeled nucleotide and with 
high dGTP concentrations (e.g., 0.1-2 mM). After performing the conventional assay, 
the synthesized DNA is separated by denaturing PAGE and transferred to a membrane 
(e.g., nitrocellulose). Telomeric DNA (the product of telomerase - an extended 
telomerase primer or substrate) can then be detected by methods such as hybridization 
using labeled telomeric DNA probes (e.g., probes containing the CCCTAA sequence, 
as used in the dot blot assay, supra) An advantage of this technique is that it is more 
sensitive than the conventional assay and provides information about the size of the 
synthesized fragments and processivity of the reaction. 

c) Activity determinations 

The telomerase activity of an hTRT polypeptide can be determined 
using an unpurified, partially purified or substantially purified hTRT polypeptide (e.g., 
in association with hTR), in vitro, or after expression in vivo. For example, telomerase 
activity in a cell (e.g., a cell expressing a recombinant hTRT polypeptide of the 
invention) can be assayed by detecting an increase or decrease in the length of 
telomeres. Typically assays for telomerase catalytic activity are carried out using an 
hTRT complexed with hTR; however, alternative telomerase template RNAs may be 
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substituted, or one may conduct assays to measure another activity, such as telomerase- 
primer binding. Assays to determine the length of telomeres are known in the art and 
include hybridization of probes to telomeric DNA (an amplification step can be 
included) and TRF analysis i.e., the analysis of telomeric DNA restriction fragments 
5 [TRFs] following restriction endonuclease digestion, see PCT publications WO 

93/23572and WO 96/41016; Counter et al., 1992, EMBOJ. 11:1921; Allsopp et al., 
1992, Proc. Natl Acad. ScL USA 89:10114; Sanno, 1996, Am J Clin Pathol 106:16 
and Sanno, 1997, Neuroendocrinology 65:299. 

The telomerase catalytic activity of an hTRT polypeptide may be 

1 0 determined in a number of ways using the assays supra and other telomerase catalytic 
activity assays. According to one method, the hTRT protein is expressed (e.g., as 
described infra) in a telomerase negative human cell in which hTR is expressed (i.e., 
either normally in the cell or through recombinant expression), and the presence or 
absence of telomerase activity in the cell or cell lysate is determined. Examples of 

1 5 suitable telomerase-negative cells are IMR 90 (ATCC, #CCL-1 86) or BJ cells (human 
foreskin fibroblast line; see, e.g., Feng et al., 1995, Science 269:1236). Other 
examples include retinal pigmented epithelial cells (RPE), human umbilical vein 
endothelial cells (HUVEC; ATCC #CRL-1730), human aortic endothelial cells (HAEC; 
Clonetics Corp, #CC-2535), and human mammary epithelial cells (HME; Hammond et 

20 al., 1984, Proc. Natl Acad. Sci. USA 81:5435; Stampfer, 1985, J. Tissue Culture 
Methods 9:107). In an alternative embodiment, the hTRT polypeptide is expressed 
(e.g., by transfection with an hTRT expression vector) in a telomerase positive cell, and 
an increase in telomerase activity in the cell compared to an untransfected control cell is 
detected if the polypeptide has telomerase catalytic activity. Usually the telomerase 

25 catalytic activity in a cell transfected with a suitable expression vector expressing hTRT 
will be significantly increased, such as at least about 2-fold, at least about 5-fold, or 
even at least about 10-fold to 100-fold or even 1000-fold higher than in untransfected 
(control) cells. 

In an alternative embodiment, the hTRT protein is expressed in a cell 
30 (e.g., a telomerase negative cell in which hTR is expressed) as a fusion protein (see 

infra) having a label or an "epitope tag" to aid in purification. In one embodiment, the 
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RNP is recovered from the cell using an antibody that specifically recognizes the tag. 
Preferred tags are typically short or small and may include a cleavage site or other 
property that allows the tag to be removed from the hTRT polypeptide. Examples of 
suitable tags include the Xpress™ epitope (Invitrogen, Inc., San Diego CA), and other 
5 moieties that can be specifically bound by an antibody or nucleic acid or other 

equivalent method such as those described in Example 6. Alternative tags include 
those encoded by sequences inserted, e.g., into SEQUENCE ID NO: 1 upstream of the 
ATG codon that initiates translation of the protein of SEQUENCE ID NO: 2, which 
may include insertion of a (new) methionine initiation codon into the upstream 
10 sequence. 

It will be appreciated that when an hTRT variant is expressed in a cell 
(e.g., as a fusion protein) and subsequently isolated (e.g., as a ribonucleoprotein 
complex), other cell proteins (i.e., telomerase-associated proteins) may be associated 
with (directly or indirectly bound to) the isolated complex. In such cases, it will 
15 sometimes be desirable to assay telomerase activity for the complex containing hTRT, 
hTR and the associated proteins. 

2) OTHER TELOMERASE OR TRT PROTEIN ACTIVITIES 

The hTRT polypeptides of the invention include variants that lack 
20 telomerase catalytic activity but retain one or more other activities of telomerase. 

These other activities and the methods of the invention for measuring such activities 
include (but are not limited to) those discussed in the following sections. 

a) Conventional reverse transcriptase activity 

Telomerase conventional reverse transcriptase activity is described in, 
25 e.g., Morin, 1997, supra, and Spence et al., 1995, Science 267:988. Because hTRT 
contains conserved amino acid motifs that are required for reverse transcriptase 
catalytic activity, hTRT has the ability to transcribe certain exogenous (e.g., non-hTR) 
RNAs. A conventional RT assay measures the ability of the enzyme to transcribe an 
RNA template by extending an annealed DNA primer. Reverse transcriptase activity 
30 can be measured in numerous ways known in the art, for example, by monitoring the 

size increase of a labeled nucleic acid primer (e.g., RNA or DNA), or incorporation of a 
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labeled dNTP. See, e.g., Ausubel et aL, supra. 

Because hTRT specifically associates with hTR, it can be appreciated 
that the DNA primer/RNA template for a conventional RT assay can be modified to 
have characteristics related to hTR and/or a telomeric DNA primer. For example, the 
RNA can have the sequence (CCCTAA) n , where n is at least 1 , or at least 3, or at least 
10 or more. In one embodiment, the (CCCTAA) n region is at or near the 5' terminus of 
the RNA (similar to the 5 r locations of template regions in telomerase RNAs). 
Similarly, the DNA primer may have a 3' terminus that contains portions of the 
TTAGGG telomere sequence, for example X n TTAG, X n AGGG, X n (TTAGGG) q TTAG, 
etc., where X is a non-telomeric sequence and n is 8-20, or 6-30, and q is 1-4. In 
another embodiment, the DNA primer has a 5' terminus that is non-complementary to 
the RNA template, such that when the primer is annealed to the RNA, the 5' terminus of 
the primer remains unbound. Additional modifications of standard reverse transcription 
assays that may be applied to the methods of the invention are known in the art. 
b) Nucleolytic activity 

Telomerase nucleolytic activity is described in e.g., Morin, 1997, supra; 
Collins and Grieder, 1993, Genes and Development 7:1364. Telomerase possesses a 
nucleolytic activity (Joyce and Steitz, 1987, Trends Biochem. Set 12:288); however, 
telomerase activity has defining characteristics. Telomerase preferentially removes 
nucleotides, usually only one, from the 3' end of an oligonucleotide when the 3 1 end of 
the DNA is positioned at the 5 f boundary of the DNA template sequence, in humans 
and Tetrahymena, this nucleotide is the first G of the telomeric repeat (TTAGG in 
humans). Telomerase preferentially removes G residues but has nucleolytic activity 
against other nucleotides. This activity can be monitored. Two different methods are 
described here for illustrative purposes. One method involves a conventional 
telomerase reaction with a primer that binds the entire template sequence (i.e., 
terminating at the template boundary; 5-TAGGGATTAG in humans). Nucleolytic 
activity is observed by monitoring the replacement of the last dG residue with a 
radiolabeled dGTP provided in the assay. The replacement is monitored by the 
appearance of a band at the size of the starting primer as shown by gel electrophoresis 
and autoradiography. 
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A preferred method uses a DNA primer that has a "blocked" 3 f terminus that 
cannot be extended by telomerase. The 3'-blocked primer can be used in a standard 
telomerase assay but will not be extended unless the 3* nucleotide is removed by the 
nucleolytic activity of telomerase. The advantage of this method is that telomerase 
5 activity can be monitored by any of several standard means, and the signal is strong and 
easy to quantify. The blocking of the 3 1 terminus of the primer can be accomplished in 
several ways. One method is the addition of a 3-deoxy-dNTP residue at the 3 f terminus 
of the primer using standard oligonucleotide synthesis techniques. This terminus has a 
2' OH but not the 3 r OH required for telomerase. Other means of blocking the 3' 

10 terminus exist, for instance, a 3 ! dideoxy terminus, a 3'-amine terminus, and others. An 
example of a primer for an hTRT nucleolytic assay is 5 ! -TTAGGGTTAGGGTTA 
(G 3H ) where the last residue denotes a 3'-deoxy-guanosine residue (Glen Research, 
Sterling, VA). Numerous other variations for a suitable primer based on the disclosure 
are known to those of skill in the art. 

15 c) Primer (telomere) binding activity 

Telomerase primer (telomere) binding activity is described in e.g., 
Morin, 1997, supra; Collins et al., 1995, Cell 81:677; Harrington et al, 1995, J. Biol 
Chem. 270:8893. Telomerase is believed to have two sites which bind a telomeric 
DNA primer. The RT motifs associated with primer binding indicate hTRT and/or 

20 hTRT/hTR possesses DNA primer binding activity. There are several ways of assaying 
primer binding activity; however, a step common to most methods is incubation of a 
labeled DNA primer with hTRT or hTRT/hTR or other TRT/TR combinations under 
appropriate binding conditions. Also, most methods employ a means of separating 
unbound DNA from protein-bound DNA; those methods include the following. 

25 i) Gel-shift assays (also called electrophoretic/mobility shift assays) are 

those in which unbound DNA primer is separated from protein-bound DNA primer by 
electrophoresis on a nondenaturing gel (Ausubel et al., supra). 

ii) Matrix binding assays include several variations to the basic 
technique, which involves binding the hTRT or hTRT/hTR complex to a matrix (e.g., 

30 nitrocellulose), either before or after incubation with the labeled primer. By binding the 
hTRT to a matrix, the unbound primer can be mechanically separated from bound 
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primer. Residual unbound DNA can be removed by washing the membrane prior to 
quantitation. Those of skill recognize there are several means of coupling proteins to 
such matrices, solid supports, and membranes, including chemical, photochemical, UV 
cross-linking, antibody/epitope, and non-covalent (hydrophobic, electrostatic, etc.) 
5 interactions. 

The DNA primer can be any DNA with an affinity for telomerase, such 
as, for example, a telomeric DNA primer like (TTAGGG) n , where n could be 1-10 and 
is typically 3-5. The 3' and 5' termini can end in any location of the repeat sequence. 
The primer can also have 5 ! or 3* extensions of non-telomeric DNA that could facilitate 
10 labeling or detection. The primer can also be derivatized, e.g., to facilitate detection or 
isolation. 

d) dNTP binding activity 

Telomerase dNTP binding activity is described in e.g., Morin, 1997, 
supra; Spence et al., supra. Telomerase requires dNTPs to synthesize DNA. The 
1 5 hTRT protein has a nucleotide binding activity and can be assayed for dNTP binding in 
a manner similar to other nucleotide binding proteins (Kantrowitz et al., 1980, Trends 
Biochem. ScL 5:124). Typically, binding of a labeled dNTP or dNTP analog can be 
monitored as is known in the art for non-telomerase RT proteins. 

e) RNA (i.e., hTR) binding activity 

20 Telomerase RNA (i.e., hTR) binding activity is described in e.g., Morin, 

1997, supra; Harrington et al., 1997, Science 275:973; Collins et al., 1995, Cell 81 :677. 
The RNA binding activity of a TRT protein of the invention may be assayed in a 
manner similar to the DNA primer binding assay described supra, using a labeled RNA 
probe. Methods for separating bound and unbound RNA and for detecting RNA are 

25 well known in the art and can be applied to the activity assays of the invention in a 

manner similar to that described for the DNA primer binding assay. The RNA can be 
full length hTR, fragments of hTR or other RNAs demonstrated to have an affinity for 
telomerase or hTRT. See U.S. Patent No. 5,583,016 and PCT Pub. No. 96/40868. 

30 3) TELOMERASE MOTIFS AS TARGETS 

The present invention, as noted supra, provides in addition to 
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recombinant hTRT with a full complement (as described supra) of activities, hTRT 
polypeptides having less than the full complement of the telornerase activities of 
naturally occurring telornerase or hTRT or other TRT proteins. It will be appreciated 
that, in view of the disclosure herein of the RT and telomerase-specific motifs of TRT, 

5 alteration or mutation of conserved amino acid residues, such as are found in the motif 
sequences discussed supra, will result in loss-of activity mutants useful for therapeutic, 
drug screening and characterization, and other uses. For example, as described in 
Example 1, deletion of motifs B through D in the RT domains of the endogenous TRT 
gene in S pombe resulted in haploid cells in which telomere progressively shortened to 

1 0 the point where hybridization of a telomere probe to telomeric repeats became almost 
undetectable, indicating a loss of telornerase catalytic activity. Similarly, alterations in 
the WxGxS site of motif E can affect telornerase DNA primer binding or function. 
Additionally, alterations of the amino acids in the motifs A, B\ and C can affect the 
catalytic activity of telornerase. Mutation of the DD motif of hTRT can significantly 

15 reduce or abolish telornerase activity (see Example 16). 

C) SYNTHESIS OF hTRT AND OTHER TRT POLYPEPTIDES 

The invention provides a variety of methods f or making the hTRT and 
other TRT polypeptides disclosed herein. In the following sections, chemical synthesis 
20 and recombinant expression of hTRT proteins, including fusi on proteins, is described in 
some detail. 

1) CHEMICAL SYNTHESIS 

The invention provides hTRT polypeptides synthesized, entirely or in 
part, using general chemical methods well known in the art (see e.g., Caruthers et aL, 
25 1980, Nucleic Acids Res. Symp. Ser., 215-223; and Horn et aL, 1980, Nucleic Acids Res. 
Symp. Ser., 225-232). For example, peptide synthesis can be performed using various 
solid-phase techniques (Roberge, et aL, 1995, Science 269:202), including automated 
synthesis (e.g., using the Perkin Elmer ABI 43 1 A Peptide Synthesizer in accordance 
with the instructions provided by the manufacturer). When Ml length protein is 
30 desired, shorter polypeptides may be fused by condensation of the amino terminus of 
one molecule with the carboxyl terminus of the other molecule to form a peptide bond. 
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The newly synthesized peptide can be substantially purified, for 
example, by preparative high performance liquid chromatography (e.g., Creighton, 
Proteins, Structures and Molecular Principles, WH Freeman and Co, New York 
NY [1983]). The composition of the synthetic peptides (or any other peptides or 
5 polypeptides of the invention) may be confirmed by amino acid analysis or sequencing 
(e.g., the Edman degradation procedure; Creighton, supra). Importantly, the amino 
acid sequence of hTRT, or any part thereof, may be altered during direct synthesis 
and/or combined using chemical methods with sequences from other proteins or 
otherwise, or any part thereof or for any purpose, to produce a variant polypeptide of 
10 the invention. 

2) RECOMBINANT EXPRESSION OF hTRT AND OTHER TRT 
PROTEINS 

The present invention provides methods, reagents, vectors, and cells 

15 useful for expression of hTRT polypeptides and nucleic acids using in vitro (cell-free), 
ex vivo or in vivo (cell or organism-based) recombinant expression systems. In one 
embodiment, expression of the hTRT protein, or fragment thereof, comprises inserting 
the coding sequence into an appropriate expression vector (i.e., a vector that contains 
the necessary elements for the transcription and translation of the inserted coding 

20 sequence required for the expression system employed). Thus, in one aspect, the 

invention provides for a polynucleotide substantially identical in sequence to an hTRT 
gene coding sequence at least 25 nucleotides, and preferably for many applications 50 
to 100 nucleotides or more, of the hTRT cDNAs or genes of the invention, which is 
operably linked to a promoter to form a transcription unit capable of expressing an 

25 hTRT polypeptide. Methods well known to those skilled in the art can be used to 
construct the expression vectors containing an hTRT sequence and appropriate 
transcriptional or translational controls provided by the present invention (see, e.g., 
Sambrook et aL, supra, Ausubei et al. supra, and this disclosure). 

The hTRT polypeptides provided by the invention include fusion 

30 proteins that contain hTRT polypeptides or fragments of the hTRT protein. The fusion 
proteins are typically produced by recombinant means, although they may also be made 
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by chemical synthesis. Fusion proteins can be useful in providing enhanced expression 
of the hTRT polypeptide constructs, or in producing hTRT polypeptides having other 
desirable properties, for example, comprising a label (such as an enzymatic reporter 
group), binding group, or antibody epitope. An exemplary fusion protein, comprising 
5 hTRT and enhanced green fluorescent protein (EGFP) sequences is described in 
Example 15, infra. It will be apparent to one of skill that the uses and applications 
discussed in Example 15 and elsewhere herein are not limited to the particular fusion 
protein, but are illustrative of the uses of various fusion constructs. 

The fusion protein systems of the invention can also be used to facilitate 

10 efficient production and isolation of hTRT proteins or peptides. For example, in some 
embodiments, the non-hTRT sequence portion of the fusion protein comprises a short 
peptide that can be specifically bound to an immobilized molecule such that the fusion 
protein can be separated from unbound components (such as unrelated proteins in a cell 
lysate). One example is a peptide sequence that is bound by a specific antibody. 

15 Another example is a peptide comprising polyhistidine tracts e.g. (His) 6 or 

histidine-tryptophan sequences that can be bound by a resin containing nickel or copper 
ions (i.e., metal-chelate affinity chromatography). Other examples include Protein A 
domains or fragments, which allow purification on immobilized immunoglobulin, and 
the domain utilized in the FLAGS extension/affinity purification system (Immunex 

20 Corp, Seattle WA). In some embodiments, the fusion protein includes a cleavage site 
so that the hTRT or other TRT polypeptide sequence can be easily separated from the 
non-hTRT peptide or protein sequence. In this case, cleavage may be chemical (e.g., 
cyanogen bromide, 2-(2-nitrophenylsulphenyl)-3-methyl-3 , -bromoindolene, 
hydroxylamine, or low pH) or enzymatic (e.g., Factor Xa, enterokinase). The choice of 

25 the fusion and cleavage systems may depend, in part, on the portion (i.e., sequence) of 
the hTRT polypeptide being expressed. Fusion proteins generally are described in 
Ausubel et al., supra, Ch. 16, KroII et aL, 1993, DNA Cell Biol 12:441, and the 
Invitrogen 1997 Catalog (Invitrogen Inc, San Diego CA). Other exemplary fusion 
proteins of the invention with epitope tags or tags and cleavage sites are provided in 

30 Example 6, infra. 

It will be appreciated by those of skill that, although the expression 
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systems discussed in this section are focused on expression of hTRT polypeptides, the 
same or similar cells, vectors and methods may be used to express hTRT 
polynucleotides of the invention, including sense and antisense polynucleotides without 
necessarily desiring production of hTRT polypeptides. Typically, expression of a 
polypeptide requires a suitable initiation codon (e.g., methionine), open reading frame, 
and translational regulatory signals (e.g., a ribosome binding site, a termination codon) 
which may be omitted when translation of a nucleic acid sequence to produce a protein 
is not desired. 

Expression of hTRT polypeptides and polynucleotides may be carried 
out to accomplish any of several related benefits provided by the present invention. 
One illustrative benefit is expression of hTRT polypeptides mat are subsequently 
isolated from the cell in which they are expressed (for example for production of large 
amounts of hTRT for use as a vaccine or in screening applications to identify 
compounds that modulate telomerase activity). A second illustrative benefit is 
expression of hTRT in a cell to change the phenotype of the cell (as in gene therapy 
applications). Nonmammalian cells can be used for expression of hTRT for 
purification, while eukaryotic especially mammalian cells (e.g., human cells) can be 
used not only for isolation and purification of hTRT but also for expression of hTRT 
when a change in phenotype in a cell is desired (e.g., to effect a change in proliferative 
capacity as in gene therapy applications). By way of illustration and not limitation, 
hTRT polypeptides having one or more telomerase activities (e.g. telomerase catalytic 
activity) can be expressed in a host cell to increase the proliferative capacity of a cell 
(e.g., immortalize a cell) and, conversely, hTRT antisense polynucleotides or inhibitory 
polypeptides typically can be expressed to reduce the proliferative capacity of a cell 
(e.g., of a telomerase positive malignant tumor cell). Numerous specific applications 
are described herein, e.g., in the discussion of uses of the reagents and methods of the 
invention for therapeutic applications, below. 

Illustrative useful expression systems (cells, regulatory elements, vectors 
and expression) of the present invention include a number of cell-free systems such as 
reticulocyte lysate and wheat germ systems using hTRT polynucleotides in accordance 
with general methods well known in the art (see, e.g., Ausubel et al. supra at Ch. 10). 
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In alternative embodiments, the invention provides reagents and methods for expressing 
hTRT in prokaryotic or eukaryotic cells. Thus, the present invention provides nucleic 
acids encoding hTRT polynucleotides, proteins, protein subsequences, or fusion 
proteins that can be expressed in bacteria, fungi, plant, insect, and animal, including 
5 human cell expression systems known in the art, including isolated cells, cell lines, cell 
cultures, tissues, and whole organisms. As will be understood by those of skill, the 
hTRT polynucleotides introduced into a host cell or cell free expression system will 
usually be operably linked to appropriate expression control sequences for each host or 
cell free system. 

1 0 Useful bacterial expression systems include E. coli, bacilli (such as 

Bacillus subtilus), other enterobacteriaceae (such as Salmonella, Serratia, and various 
Pseudomonas species) or other bacterial hosts (e.g., Streptococcus cremoris, 
Streptococcus lactis, Streptococcus thermophilus, Leuconostoc citrovorum, 
Leuconostoc mesenteroides, Lactobacillus acidophilus, Lactobacillus lactis, 

1 5 Bifidobacterium bifidum, Bifidobacteriu breve, and Bifidobacterium longum). The 

hTRT expression constructs useful in prokaryotes include recombinant bacteriophage, 
plasmid or cosmid DNA expression vectors, or the like, and typically include promoter 
sequences. Illustrative promoters include inducible promoters, such as the lac 
promoter, the hybrid lacZ promoter of the Bluescript7 phagemid [Stratagene, La Jolla 

20 CA] or pSportl [Gibco BRL]; phage lambda promoter systems; a tryptophan (tip) 
promoter system; and ptrp-lac hybrids and the like. Bacterial expression constructs 
optionally include a ribosome binding site and transcription termination signal 
regulatory sequences. Illustrative examples of specific vectors useful for expression 
include, for example, pTrcHis2, (Invitrogen, San Diego CA), pThioHis A, B & C, and 

25 numerous others known in the art or that may be developed (see, e.g. Ausubel). Useful 
vectors for bacteria include those that facilitate production of hTRT- fusion proteins. 
Useful vectors for high level expression of fusion proteins in. bacterial cells include, but 
are not limited to, the multifunctional K coli cloning and expression vectors such as 
Bluescript7 (Stratagene), noted above, in which the sequence encoding hTRT protein, 

30 an hTRT fusion protein or an hTRT fragment may be ligated into the vector in-frame 
with sequences for the amino-terminal Met and the subsequent 7 residues of 
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P-galactosidase so that a hybrid protein is produced (e.g., pIN vectors; Van Heeke and 
Schuster, 1989, 1 Biol Chern., 264:5503). Vectors such as pGEX vectors (e.g., pGEX- 
2TK; Pharmacia Biotech) may also be used to express foreign polypeptides, such as 
hTRT protein, as fusion proteins with glutathione S-transferase (GST). Such fusion 
5 proteins may be purified from lysed cells by adsorption to glutathione-agarose beads 
followed by elution in the presence of free glutathione. Proteins made in such systems 
often include enterokinase, thrombin or factor Xa protease cleavage sites so that the 
cloned polypeptide of interest can be released from the GST moiety at will, as may be 
useful in purification or other applications. Other examples £ire fusion proteins 

10 comprising hTRT and the E. coli Maltose Binding Protein (MBP) or E. Coli 

thioredoxin. Illustrative examples of hTRT expression constructs useful in bacterial 
cells are provided in Example 6, infra. 

The invention further provides hTRT polypeptides expressed in fungal 
systems, such as Dictyostelium and, preferably, yeast, such as Saccharomyces 

15 cerevisiae, Pichia pastoris, Torulopsis holmil, Saccharomyces fragilis, Saccharomyces 
lactis, Hansenula polymorpha and Candida pseudotropicalis. When hTRT is 
expressed in yeast, a number of suitable vectors are available, including plasmid and 
yeast artificial chromosomes (YACs) vectors. The vectors typically include expression 
control sequences, such as constitutive or inducible promoters (e.g., such as alpha 

20 factor, alcohol oxidase, PGH, and 3-phosphoglycerate kinase or other glycolytic 

enzymes), and an origin of replication, termination sequences and the like, as desired. 
Suitable vectors for use in Pichia include pPICZ, His6/pPICZB, pPICZalpha, 
pPIC3.5K, pPIC9K, pA0815, pGAP2A, B & C, pGAP2alpha A, B, and C (Invitrogen, 
San Diego, CA) and numerous others known in the art or to be developed. In one 

25 embodiment, the vector His6/pPICZB (Invitrogen, San Diego, CA) is used to express a 
His 6 -hTRT fusion protein in the yeast Pichia pastoris. An example of a vector useful in 
Saccharomyces is pYES2 (Invitrogen, San Diego, CA). Illustrative examples of hTRT 
expression constructs useful in yeast are provided in Example 6, infra. 

The hTRT polypeptides of the invention may also be expressed in plant 

30 cell systems transfected with plant or plant virus expression vectors (e.g., cauliflower 
mosaic virus, CaMV; tobacco mosaic virus, TMV) or transformed with bacterial 
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expression vectors (e.g., Ti or pBR322 plasmid). In cases where plant virus expression 
vectors are used, the expression of an hTRT-encoding sequence may be driven by any 
of a number of promoters. For example, viral promoters such as the 35S and 19S 
promoters of CaMV (Brisson et aL, 1984, Nature 310:51 1-514) may be used alone or in 
combination with the omega leader sequence from TMV (Takamatsu et aL, 1987, 
EMBO J., 6:307-3 1 1). Alternatively, plant promoters such as that from the small 
subunit gene of RUBISCO (Coruzzi et aL, 1984, EMBO J., 3:1671-1680; Broglie et aL, 
1984, Science 224:838-843) or heat shock promoters (Winter and Sinibaldi, 1991, 
Results Probl. Cell Differ., 17:85), or storage protein gene promoters may be used. 
These constructs can be introduced into plant cells by direct DNA transformation or 
pathogen-mediated transfection (for reviews of such techniques, see Hobbs or Murry, 
1992, in McGraw Hill Yearbook of Science and Technology McGraw Hill New 
York NY, pp. 191-196 [1992]; or Weissbach and Weissbach, 1988, METHODS FOR 
Plant Molecular Biology, Academic Press, New York NY, pp. 421-463). 

Another expression system provided by the invention for expression of 
hTRT protein is an insect system. A preferred system uses a baculovirus polyhedrin 
promoter. In one such system, Autographa californica nuclear polyhedrosis virus 
(AcNPV) is used as a vector to express foreign genes in Spodopterafrugiperda cells or 
in Trichoplusia larvae. The sequence encoding the gene of interest may be cloned into 
a nonessential region of the virus, such as the polyhedrin gene, and placed under control 
of the polyhedrin promoter. Successful insertion of the sequence, e.g., encoding the 
hTRT protein, will render the polyhedrin gene inactive and produce recombinant virus 
lacking coat protein. The recombinant viruses are then used to infect S. frugiperda 
cells or Trichoplusia larvae, in which the hTRT sequence is then expressed (see, for 
general methods, Smith et aL, J. Virol., 46:584 [1983]; Engelhard et al., Proc. Natl. 
Acad. Sci. 91:3224-7 [1994]). Useful vectors for baculovirus expression include 
pBlueBacHis2 A, B & C, pBlueBac4.5, pMelBacB and numerous others known in the 
art or to be developed. Illustrative examples of hTRT expre ssion constructs useful in 
insect cells are provided in Example 6, infra. 

The present invention also provides expression systems in mammals and 
mammalian cells. As noted supra, hTRT polynucleotides may be expressed in 



64 



mammalian cells (e.g., human cells) for production of significant quantities of hTRT 
polypeptides (e.g., for purification) or to change the phenotype of a target cell (e.g., for 
purposes of gene therapy, cell immortalization, or other). In the latter case, the hTRT 
polynucleotide expressed may or may not encode a polypeptide with a telomerase 
5 catalytic activity. That is, expression may be of a sense or antisense polynucleotide, an 
inhibitory or stimulatory polypeptide, a polypeptide with zero, one or more telomerase 
activities, and other combinations and variants disclosed herein or apparent to one of 
skill upon review of this disclosure. 

Suitable mammalian host tissue culture cells for expressing the nucleic 

10 acids of the invention include any normal mortal or normal or abnormal immortal 
animal or human cell, including: monkey kidney CV1 line transformed by SV40 
(COS-7, ATCC CRL 1651); human embryonic kidney line (293; Graham et al., J. Gen. 
Virol 36:59 (1977)); baby hamster kidney cells (BHK, ATCC CCL 10); CHO (ATCC 
CCL 61 and CRL 9618); mouse Sertoli cells (TM4, Mather, Biol Reprod. 23:243-251 

15 (1980)); monkey kidney cells (CV1 ATCC CCL 70); African green monkey kidney 
cells (VERO-76, ATCC CRL 1587); human cervical carcinoma cells (HeLa, ATCC 
CCL 2); canine kidney cells (MDCK, ATCC CCL 34); buffalo rat liver cells (BRL 3 A, 
ATCC CRL 1442); human lung cells (W138, ATCC CCL 75); human liver cells (Hep 
G2, HB 8065); mouse mammary tumor (MMT 060562, ATCC CCL51); TRI cells 

20 (Mather, et al., Annals N. 7. Acad, Set 383 :44-46 (1 982); MDCK cells (ATCC CCL 34 
and CRL 6253); HEK 293 cells (ATCC CRL 1573); and WI-38 cells (ATCC CCL 75; 
ATCC: American Type Culture Collection, Rockville, MD).. The use of mammalian 
tissue cell culture to express polypeptides is discussed gener ally in Winnacker, From 
Genes to Clones (VCH Publishers, N.Y., N.Y., 1987). 

25 For mammalian host cells, viral-based and nonviral expression systems 

are provided. Nonviral vectors and systems include plasmids and episomal vectors, 
typically with an expression cassette for expressing a protein or RNA, and human 
artificial chromosomes (see, e.g., Harrington et al., 1997, Nat Genet 15:345). For 
example, nonviral vectors useful for expression of hTRT polynucleotides and 

30 polypeptides in mammalian (e.g., human) cells include pcDNA3.1/His, pEBVHis A, B 
& C, (Invitrogen, San Diego C A), MPS V vectors, others described in the Invitrogen 
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1997 Catalog (Invitrogen Inc, San Diego CA), which is incorporated in its entirety 
herein, and numerous others known in the art for other proteins. Illustrative examples 
of hTRT expression constructs useful in mammalian cells are provided in Example 6, 
infra. 

Useful viral vectors include vectors based on retroviruses, adenoviruses, 
adenoassociated viruses, herpes viruses, vectors based on SV40, papilloma virus, HBP 
Epstein Barr virus, vaccinia virus vectors and Semliki Forest virus (SFV). SFV and 
vaccinia vectors are discussed generally in Ausubel et aL, supra, Ch 16. These vectors 
are often made up of two components, a modified viral genome and a coat structure 
surrounding it {see generally Smith, 1995, Annu. Rev. Microbiol 49: 807), although 
sometimes viral vectors are introduced in naked form or coated with proteins other than 
viral proteins. However, the viral nucleic acid in a vector may be changed in many 
ways, for example, when designed for gene therapy. The goals of these changes are to 
disable growth of the virus in target cells while maintaining its ability to grow in vector 
form in available packaging or helper cells, to provide space within the viral genome 
for insertion of exogenous DNA sequences, and to incorporate new sequences that 
encode and enable appropriate expression of the gene of interest. Thus, vector nucleic 
acids generally comprise two components: essential cis-acting viral sequences for 
replication and packaging in a helper line and the transcription unit for the exogenous 
gene. Other viral functions are expressed in trans in a specific packaging or helper cell 
line. Adenoviral vectors (e.g., for use in human gene therapy) are described in, e.g., 
Rosenfeld et al., 1992, Cell 68: 143; PCT publications WO 94/12650; 94/12649; and 
94/12629. In cases where an adenovirus is used as an expression vector, a sequence 
encoding hTRT may be ligated into an adenovirus transcription/translation complex 
consisting of the late promoter and tripartite leader sequence. Insertion in a 
nonessential El or E3 region of the viral genome will result in a viable virus capable of 
expressing in infected host cells (Logan and Shenk, 1984, Proc. Natl Acad Set, 
8 1 :3655). Replication-defective retroviral vectors harboring a therapeutic 
polynucleotide sequence as part of the retroviral genome are described in, e.g., Miller et 
al., 1990, Mol Cell Biol 10: 4239; Kolberg, 1992, J. NIHRes. 4: 43; and Cornetta et 
al., 1991, Hum. Gene Ther. 2: 215. 
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In mammalian cell systems, promoters from mammalian genes or from 
mammalian viruses are often appropriate. Suitable promoters may be constitutive, cell 
type-specific, stage-specific, and/or modulatable or regulatable (e.g., by hormones such 
as glucocorticoids). Useful promoters include, but are not limited to, the 
5 metallothionein promoter, the constitutive adenovirus major late promoter, the 
dexamethasone-inducible MMTV promoter, the SV40 promoter, the MRP polIII 
promoter, the constitutive MPSV promoter, the tetracycline-inducible CMV promoter 
(such as the human immediate-early CMV promoter), the constitutive CMV promoter, 
and promoter-enhancer combinations known in the art. 

1 0 Other regulatory elements may also be required or desired for efficient 

expression of an hTRT polynucleotide and/or translation of a sequence encoding hTRT 
proteins. For translation, these elements typically include an ATG initiation codon and 
adjacent ribosome binding site or other sequences. For sequences encoding the hTRT 
protein, provided its initiation codon and upstream promoter sequences are inserted into 

15 an expression vector, no additional translational or other control signals may be needed. 
However, in cases where only coding sequence, or a portion thereof, is inserted, 
exogenous transcriptional and/or translational control signals (e.g., the promoter, 
ribosome-binding site, and ATG initiation codon) must often be provided. 
Furthermore, the initiation codon must typically be in the correct reading frame to 

20 ensure translation of the desired protein. Exogenous transcriptional elements and 

initiation codons can be of various origins, both natural and synthetic. In addition, the 
efficiency of expression may be enhanced by the inclusion of enhancers appropriate to 
the cell system in use (ScharfetaL, 1994, Results Probl Cell Differ. 20:125;and 
Bittner et al. 1987, Meth EnzymoL, 153:516). For example, the SV40 enhancer or 

25 CMV enhancer may be used to increase expression in mammalian host cells. 

Expression of hTRT gene products can also by effected (increased) by 
activation of an hTRT promoter or enhancer in a cell such as a human cell, e.g., a 
telomerase-negative cell line. Activation can be carried out in a variety of ways, 
including administration of an exogenous promoter activating agent, or inhibition of a 

30 cellular component that suppresses expression of the hTRT gene. It will be appreciated 
that, conversely, inhibition of promoter function, as described infra, will reduce hTRT 
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gene expression. 

The invention provides inducible and repressible expression of hTRT 
polypeptides using such system as the Ecdysone-Inducible Expression System 
(Invitrogen), and the Tet-On and Tet-off tetracycline regulated systems from Clontech. 
5 The ecdysone-inducible expression system uses the steroid hormone ecdysone analog, 
muristerone A, to activate expression of a recombinant protein via a heterodimeric 
nuclear receptor (No et aL, 1996, Proc. Natl. Acad, Set USA 93:3346). In one 
embodiment of the invention, hTRT is cloned in the pIND vector (Clontech), which 
contains five modified ecdysone response elements (E/GREs) upstream of a minimal 

10 heat shock promoter and the multiple cloning site. The construct is then transfected in 
cell lines stably expressing the ecdysone receptor. After transection, cells are treated 
with muristerone A to induce intracellular expression from pIND. In another 
embodiment of the invention, hTRT polypeptide is expressed using the Tet-on and 
Tet-off expression systems (Clontech) to provide regulated, Itigh-level gene expression 

15 (Gossen et al., 1992, Proa Natl. Acad Set USA 89:5547; Gossen et aL, 1995, Science 
268:1766). 

The hTRT vectors of the invention may be introduced into a cell, tissue, 
organ, patient or animal by a variety of methods. The nucleic acid expression vectors 
(typically dsDNA) of the invention can be transferred into the chosen host cell by 

20 well-known methods such as calcium chloride transformation (for bacterial systems), 
electroporation, calcium phosphate treatment, liposome-mediated transformation, 
injection and microinjection, ballistic methods, virosomes, iramunoliposomes, 
polycatiomnucleic acid conjugates, naked DNA, artificial virions, fusion to the herpes 
virus structural protein VP22 (Elliot and O'Hare, Cell 88:223), agent-enhanced uptake 

25 of DNA, and ex vivo transduction. Useful liposome-mediated DNA transfer methods 
are described in US Patent Nos. 5,049,386, US 4,946,787; and US 4,897,355; PCT 
publications WO 91/17424, WO 91/16024; Wang and Huang, 1987, Biochem. Biophys. 
Res, Commun. 147: 980; Wang and Huang, 1989, Biochemistry 28: 9508; Litzinger and 
Huang, 1992, Biochem. Biophys. Acta 1 1 13:201; Gao and Huang, 1991, Biochem. 

30 Biophys. Res. Commun. 179: 280. Irnmunoliposomes have been described as carriers 
of exogenous polynucleotides (Wang and Huang, 1987, Proc. Natl. Acad. Sci. U.S.A. 
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84:7851; Trubetskoy et al., 1992, Biochem. Biophys. Acta 1 131:31 1) and may have 
improved cell type specificity as compared to liposomes by virtue of the inclusion of 
specific antibodies which presumably bind to surface antigens on specific cell types. 
Behr et al. 5 1989, Proc. Natl. Acad. Sci. U.S.A. 86:6982 report using lipopolyamine as a 
reagent to mediate transfection itself, without the necessity of any additional 
phospholipid to form liposomes. Suitable delivery methods will be selected by 
practitioners in view of acceptable practices and regulatory requirements (e.g., for gene 
therapy or production of cell lines for expression of recombinant proteins). It will be 
appreciated that the delivery methods listed above may be used for transfer of nucleic 
acids into cells for purposes of gene therapy, transfer into tissue culture cells, and the 
like. 

For long-term, high-yield production of recombinant proteins, stable 
expression will often be desired. For example, cell lines which stably express hTRT 
can be prepared using expression vectors of the invention which contain viral origins of 
replication or endogenous expression elements and a selectable marker gene. 
Following the introduction of the vector, cells may be allowed to grow for 1-2 days in 
an enriched media before they are switched to selective media. The purpose of the 
selectable marker is to confer resistance to selection, and its presence allows growth of 
cells which successfully express the introduced sequences in selective media. 
Resistant, stably transfected cells can be proliferated using tissue culture techniques 
appropriate to the cell type. An amplification step, e.g., by administration of 
methyltrexate to cells transfected with a DHFR gene according to methods well known 
in the art, can be included. 

In addition, a host cell strain may be chosen for its ability to modulate 
the expression of the inserted sequences or to process the expressed protein in the 
desired fashion. Such modifications of the polypeptide include, but are not limited to, 
acetylation, carboxylation, phosphorylation, lipidation and a.cylation. Post-translational 
processing may also be important for correct insertion, folding and/or function. 
Different host cells have cellular machinery and characteristic mechanisms specific for 
each cell for such post-translational activities and so a particular cell may be chosen to 
ensure the correct modification and processing of the introduced, foreign protein. 
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As noted supra, when expressing an hTRT protein (including variants) 
in cells or organisms it is sometimes desirable to use an hTRT protein-encoding 
polynucleotide that employs a codon distribution other than that found in a naturally 
occurring hTRT gene. hTRT protein-encoding polynucleotides with alternative codons 

5 throughout, or at specific sites, in the coding sequence are used to optimize (e.g., 
increase) expression of the hTRT protein in cells, especially non-human cells (e.g., 
bacterial, plant, fungal, and non-human animal cells) which have different preferential 
codon usage than human cells. Codon changes may also be used to facilitate 
manipulation of the hTRT polynucleotide (e.g., by engineering useful tags or restriction 

1 0 sites into the coding sequence), and for other reasons. When the goal is to optimize 

expression (e.g., by increasing translational efficiency), tables of preferred codon usage, 
which are publicly available and are well known to those of skill, are used to design a 
suitable polynucleotide by "reverse translation" of the desired (e.g., hTRT) amino acid 
sequence. Alternatively, preferred codon usage can be determined for a particular 

1 5 organism (e.g., Pichia pastoris) or class of genes (e.g., highly expressed genes of a 
particular organism) by comparison of published gene sequences for the target 

organism or gene class. 

Illustrative hTRT-encoding polynucleotide sequences are provided in 
Table 9 (A-E), infra. All of the sequences in Table 9 are in the 5'->3'. Table 9A shows 

20 an hTRT protein encoding polynucleotide that uses a codon distribution preferentially 
employed in the bacterium E. coli. Table 9B shows a second polynucleotide sequence 
particularly useful for expression in E. coli (and other enteric bacteria) using codons 
preferentially used in highly expressed genes in enteric bacteria. Table 4C shows an 
hTRT protein encoding polynucleotide that uses a codon distribution preferentially 

25 employed in yeast {i.e., S. cerevisiae). Table 4D shows an hTRT protein encoding 
polynucleotide that uses a codon distribution preferentially used in highly expressed 
genes in yeast. Table 4E shows an hTRT protein encoding polynucleotide that uses a 
"generic" codon distribution that should be efficiently expressed in both bacteria (e.g., 
E. coli) and yeast (e.g., S. pombe, S. cerevisiae, P. pastoris) and some insect (e.g., S. 

30 frugiperda) cells. Such "generic" polynucleotide sequences (optimized for more than 
one organism) are useful for, for example, comparative studies, screening in different 
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organisms of hTRT binding or modulatory agents, creation of shuttle vectors, and other 
uses. In this "generic" sequence, the codon TCT (serine) may not be optimal for 
expression in Drosophila cells. Therefore, in an alternative embodiment the sequence 
in Table 4E is modified to replace TCT with TCC for efficient expression in 
Drosophila as well as bacteria and yeast. 

TABLE 9 

hTRT-ENCODING POLYNUCLEOTIDE SEQUENCES EMPLOYING 
AT.TF.RNATTVE CO^ON DISTRIBUTIONS 



Table 9A 
F.. coli fall genes') 

ATG CCG CGC GCG CCG CGC TGC CGC GCG GTG CGC AGC CTG CTG CGC AGC CAT TAT 
CGC GAA GTG CTG CCG CTG GCG ACC TTT GTG CGC CGC CTG GGC CCG CAG GGC TGG 
15 CGC CTG GTG CAG CGC GGC GAT CCG GCG GCG TTT CGC GCG CTG GTG GCG CAG TGC 
CTG GTG TGC GTG CCG TGG GAT GCG CGC CCG CCG CCG GCG GCG CCG AGC TTT CGC 
CAG GTG AGC TGC CTG AAA GAA CTG GTG GCG CGC GTG CTG CAG CGC CTG TGC GAA 
CGC GGC GCG AAA AAC GTG CTG GCG TTT GGC TTT GCG CTG CTG GAT GGC GCG CGC 
GGC GGC CCG CCG GAA GCG TTT ACC ACC AGC GTG CGC AGC TAT CTG CCG AAC ACC 
20 GTG ACC GAT GCG CTG CGC GGC AGC GGC GCG TGG GGC CTG CTG CTG CGC CGC GTG 
GGC GAT GAT GTG CTG GTG CAT CTG CTG GCG CGC TGC GCG CTG TTT GTG CTG GTG 
GCG CCG AGC TGC GCG TAT CAG GTG TGC GGC CCG CCG CTG TAT CAG CTG GGC GCG 
GCG ACC CAG GCG CGC CCG CCG CCG CAT GCG AGC GGC CCG CGC CGC CGC CTG GGC 
TGC GAA CGC GCG TGG AAC CAT AGC GTG CGC GAA GCG GGC GTG CCG CTG GGC CTG 
25 CCG GCG CCG GGC GCG CGC CGC CGC GGC GGC AGC GCG AGC CGC AGC CTG CCG CTG 
CCG AAA CGC CCG CGC CGC GGC GCG GCG CCG GAA CCG GAA CGC ACC CCG GTG GGC 
CAG GGC AGC TGG GCG CAT CCG GGC CGC ACC CGC GGC CCG AGC GAT CGC GGC TTT 
TGC GTG GTG AGC CCG GCG CGC CCG GCG GAA GAA GCG ACC AGC CTG GAA GGC GCG 
CTG AGC GGC ACC CGC CAT AGC CAT CCG AGC GTG GGC CGC CAG CAT CAT GCG GGC 
30 CCG CCG AGC ACC AGC CGC CCG CCG CGC CCG TGG GAT ACC CCG TGC CCG CCG GTG 
TAT GCG GAA ACC AAA CAT TTT CTG TAT AGC AGC GGC GAT AAA GAA CAG CTG CGC 
CCG AGC TTT CTG CTG AGC AGC CTG CGC CCG AGC CTG ACC GGC GCG CGC CGC CTG 
GTG GAA ACC ATT TTT CTG GGC AGC CGC CCG TGG ATG CCG GGC ACC CCG CGC CGC 
CTG CCG CGC CTG CCG CAG CGC TAT TGG CAG ATG CGC CCG CTG TTT CTG GAA CTG 
35 CTG GGC AAC CAT GCG CAG TGC CCG TAT GGC GTG CTG CTG AAA ACC CAT TGC CCG 
CTG CGC GCG GCG GTG ACC CCG GCG GCG GGC GTG TGC GCG CGC GAA AAA CCG CAG 
GGC AGC GTG GCG GCG CCG GAA GAA GAA GAT ACC GAT CCG CGC CGC CTG GTG CAG 
CTG CTG CGC CAG CAT AGC AGC CCG TGG CAG GTG TAT GGC TTT GTG CGC GCG TGC 
CTG CGC CGC CTG GTG CCG CCG GGC CTG TGG GGC AGC CGC CAT AAC GAA CGC CGC 
40 TTT CTG CGC AAC ACC AAA AAA TTT ATT AGC CTG GGC AAA CAT GCG AAA CTG AGC 
CTG CAG GAA CTG ACC TGG AAA ATG AGC GTG CGC GAT TGC GCG TGG CTG CGC CGC 
AGC CCG GGC GTG GGC TGC GTG CCG GCG GCG GAA CAT CGC CTG CGC GAA GAA ATT 
CTG GCG AAA TTT CTG CAT TGG CTG ATG AGC GTG TAT GTG GTG GAA CTG CTG CGC 
AGC TTT TTT TAT GTG ACC GAA ACC ACC TTT CAG AAA AAC CGC CTG TTT TTT TAT 
45 CGC AAA AGC GTG TGG AGC AAA CTG CAG AGC ATT GGC ATT CGC CAG CAT CTG AAA 
CGC GTG CAG CTG CGC GAA CTG AGC GAA GCG GAA GTG CGC CAG CAT CGC GAA GCG 
CGC CCG GCG CTG CTG ACC AGC CGC CTG CGC TTT ATT CCG AAA CCG GAT GGC CTG 
CGC CCG ATT GTG AAC ATG GAT TAT GTG GTG GGC GCG CGC ACC TTT CGC CGC GAA 
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mi nni t 



AAA 


CGC 


GCG 


GAA 


CGC 


CTG 


ACC 


AGC 


CGC 


GTG 


AAA 


GCG 


CTG 


TTT 


AGC 


GTG 


CTG 


AAC 


TAT 


GAA 


CGC 


GCG 


CGC 


CGC 


CCG 


GGC 


CTG 


CTG 


GGC 


GCG 


AGC 


GTG 


CTG 


GGC 


CTG 


GAT 


GAT 


ATT 


CAT 


CGC 


GCG 


TGG 


CGC 


ACC 


TTT 


GTG 


CTG 


CGC 


GTG 


CGC 


GCG 


CAG 


GAT 


CCG 


CCG 


CCG 


GAA 


CTG 


TAT 


TTT 


GTG 


AAA 


GTG 


GAT 


GTG 


ACC 


GGC 


GCG 


TAT 


GAT 


ACC 


ATT 


CCG 


CAG 


GAT 


CGC 


CTG 


ACC 


GAA 


GTG 


ATT 


GCG 


AGC 


ATT 


ATT 


AAA 


CCG 


CAG 


AAC 


ACC 


TAT 


TGC 


GTG 


CGC 


CGC 


TAT 


GCG 


GTG 


GTG 


CAG 


AAA 


GCG 


GCG 


CAT 


GGC 


CAT 


GTG 


CGC 


AAA 


GCG 


TTT 


AAA 


AGC 


CAT 


GTG 


AGC 


ACC 


CTG 


ACC 


GAT 


CTG 


CAG 


CCG 


TAT 


ATG 


CGC 


CAG 


TTT 


GTG 


GCG 


CAT 


CTG 


CAG 


GAA 


ACC 


AGC 


CCG 


CTG 


CGC 


GAT 


GCG 


GTG 


GTG 


ATT 


GAA 


CAG 


AGC 


AGC 


AGC 


CTG 


AAC 


GAA 


GCG 


AGC 


AGC 


GGC 


CTG 


TTT 


GAT 


GTG 


TTT 


CTG 


CGC 


TTT 


ATG 


TGC 


CAT 


CAT 


GCG 


GTG 


CGC 


ATT 


CGC 


GGC 


AAA 


AGC 


TAT 


GTG 


CAG 


TGC 


CAG 


GGC 


ATT 


CCG 


CAG 


GGC 


AGC 


ATT 


CTG 


AGC 


ACC 


CTG 


CTG 


TGC 


AGC 


CTG 


TGC 


TAT 


GGC 


GAT 


ATG 


GAA 


AAC 


AAA 


CTG 


TTT 


GCG 


GGC 


ATT 


CGC 


CGC 


GAT 


GGC 


CTG 


CTG 


CTG 


CGC 


CTG 


GTG 


GAT 


GAT 


TTT 


CTG 


CTG 


GTG 


ACC 


CCG 


CAT 


CTG 


ACC 


CAT 


GCG 


AAA 


ACC 


TTT 


CTG 


CGC 


ACC 


CTG 


GTG 


CGC 


GGC 


GTG 


CCG 


GAA 


TAT 


GGC 


TGC 


GTG 


GTG 


AAC 


CTG 


CGC 


AAA 


ACC 


GTG 


GTG 


AAC 


TTT 


CCG 


GTG 


GAA 


GAT 


GAA 


GCG 


CTG 


GGC 


GGC 


ACC 


GCG 


TTT 


GTG 


CAG 


ATG 


CCG 


GCG 


CAT 


GGC 


CTG 


TTT 


CCG 


TGG 


TGC 


GGC 


CTG 


CTG 


CTG 


GAT 


ACC 


CGC 


ACC 


CTG 


GAA 


GTG 


CAG 


AGC 


GAT 


TAT 


AGC 


AGC 


TAT 


GCG 


CGC 


ACC 


AGC 


ATT 


CGC 


GCG 


AGC 


CTG 


ACC 


TTT 


AAC 


CGC 


GGC 


TTT 


AAA 


GCG 


GGC 


CGC 


AAC 


ATG 


CGC 


CGC 


AAA 


CTG 


TTT 


GGC 


GTG 


CTG 


CGC 


CTG 


AAA 


TGC 


CAT 


AGC 


CTG 


TTT 


CTG 


GAT 


CTG 


CAG 


GTG 


AAC 


AGC 


CTG 


CAG 


ACC 


GTG 


TGC 


ACC 


AAC 


ATT 


TAT 


AAA 


ATT 


CTG 


CTG 


CTG 


CAG 


GCG 


TAT 


CGC 


TTT 


CAT 


GCG 


TGC 


GTG 


CTG 


CAG 


CTG 


CCG 


TTT 


CAT 


CAG 


CAG 


GTG 


TGG 


AAA 


AAC 


CCG 


ACC 


TTT 


TTT 


CTG 


CGC 


GTG 


ATT 


AGC 


GAT 


ACC 


GCG 


AGC 


CTG 


TGC 


TAT 


AGC 


ATT 


CTG 


AAA 


GCG 


AAA 


AAC 


GCG 


GGC 


ATG 


AGC 


CTG 


GGC 


GCG 


AAA 


GGC 


GCG 


GCG 


GGC 


CCG 


CTG 


CCG 


AGC 


GAA 


GCG 


GTG 


CAG 


TGG 


CTG 


TGC 


CAT 


CAG 


GCG 


TTT 


CTG 


CTG 


AAA 


CTG 


ACC 


CGC 


CAT 


CGC 


GTG 


ACC 


TAT 


GTG 


CCG 


CTG 


CTG 


GGC 


AGC 


CTG 


CGC 


ACC 


GCG 


CAG 


ACC 


CAG 


CTG 


AGC 


CGC 


AAA 


CTG 


CCG 


GGC 


ACC 


ACC 


CTG 


ACC 


GCG 


CTG 


GAA 


GCG 


GCG 


GCG 


AAC 


CCG 


GCG 


CTG 


CCG 


AGC 


GAT 


TTT 


AAA 


ACC 


ATT 


CTG 


GAT 







Table 9B 

Enteric Bacteria (High Expressing Genes) 

1 ATGCCGCGTG CTCCGCGTTG CCGTGCTGTT CGTTCCCTGC TGCGTTCCCA 

51 CTACCGTGAA GTTCTGCCGC TGGCTACCTT CGTTCGTCGT CTGGGTCCGC 

101 AGGGTTGGCG TCTGGTTCAG CGTGGTGACC CGGCTGCTTT CCGTGCTCTG 

151 GTTGCTCAGT GCCTGGTTTG CGTTCCGTGG GACGCTCGTC CGCCGCCGGC 

201 TGCTCCGTCC TTCCGTCAGG TTTCCTGCCT GAAAGAACTG GTTGCTCGTG 

251 TTCTGCAGCG TCTGTGCGAA CGTGGTGCTA AAAACGrTTCT GGCTTTCGGT 

301 TTCGCTCTGC TGGACGGTGC TCGTGGTGGT CCGCCGGAAG CTTTCACCAC 

351 CTCCGTTCGT TCCTACCTGC CGAACACCGT TACCGACGCT CTGCGTGGTT 

401 CCGGTGCTTG GGGTCTGCTG CTGCGTCGTG TTGGTGACGA CGTTCTGGTT 

451 CACCTGCTGG CTCGTTGCGC TCTGTTCGTT CTGGTTGCTC CGTCCTGCGC 

501 TTACCAGGTT TGCGGTCCGC CGCTGTACCA GCTGGGTGCT GCTACCCAGG 

551 CTCGTCCGCC GCCGCACGCT TCCGGTCCGC GTCGTCGTCT GGGTTGCGAA 

601 CGTGCTTGGA ACCACTCCGT TCGTGAAGCT GGTGTTCCGC TGGGTCTGCC 

651 GGCTCCGGGT GCTCGTCGTC GTGGTGGTTC CGCTTCCCGT TCCCTGCCGC 

701 TGCCGAAACG TCCGCGTCGT GGTGCTGCTC CGGAACCGGA ACGTACCCCG 

751 GTTGGTCAGG GTTCCTGGGC TCACCCGGGT CGTACCCGTG GTCCGTCCGA 

801 CCGTGGTTTC TGCGTTGTTT CCCCGGCTCG TCCGGCTGAA GAAGCTACCT 

851 CCCTGGAAGG TGCTCTGTCC GGTACCCGTC ACTCCCACCC GTCCGTTGGT 

901 CGTCAGCACC ACGCTGGTCC GCCGTCCACC TCCCGTCCGC CGCGTCCGTG 

951 GGACACCCCG TGCCCGCCGG TTTACGCTGA AACCAAACAC TTCCTGTACT 

1001 CCTCCGGTGA CAAAGAACAG CTGCGTCCGT CCTTCCTGCT GTCCTCCCTG 

1051 CGTCCGTCCC TGACCGGTGC TCGTCGTCTG GTTGA&ACCA TCTTCCTGGG 

1101 TTCCCGTCCG TGGATGCCGG GTACCCCGCG TCGTCTGCCG CGTCTGCCGC 
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1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 
2251 
2301 
2351 
2401 
2451 
2501 
2551 
2601 
2651 
2701 
2751 
2801 
2851 
2901 
2951 
3001 
3051 
3101 
3151 
3201 
3251 
3301 
3351 



AGCGTTACTG 
GCTCAGTGCC 
TGCTGTTACC 
CCGTTGCTGC 
CTGCTGCGTC 
TTGCCTGCGT 
AACGTCGTTT 
GCTAAACTGT 
CGCTTGGCTG 
ACCGTCTGCG 
GTTTACGTTG 
CTTCCAGAAA 
TGCAGTCCAT 
CTGTCCGAAG 
GACCTCCCGT 
TTAACATGGA 
GCTGAACGTC 
CGAACGTGCT 
ACGACATCCA 
GACCCGCCGC 
CGACACCATC 
AACCGCAGAA 
GCTCACGGTC 
CGACCTGCAG 
CCCCGCTGCG 
GCTTCCTCCG 
TGTTCGTATC 
GTTCCATCCT 
AACAAACTGT 
TGACGACTTC 
TGCGTACCCT 
CGTAAAACCG 
CGCTTTCGTT 
TGCTGGACAC 
CGTACCTCCA 
TCGTAACATG 
CCCTGTTCCT 
ATCTACAAAA 
GCAGCTGCCG 
GTGTTATCTC 
AACGCTGGTA 
CGAAGCTGTT 
GTCACCGTGT 
ACCCAGCTGT 
TGCTGCTAAC 



GCAGATGCGT 
CGTACGGTGT 
CCGGCTGCTG 
TCCGGAAGAA 
AGCACTCCTC 
CGTCTGGTTC 
CCTGCGTAAC 
CCCTGCAGGA 
CGTCGTTCCC 
TGAAGAAATC 
TTGAACTGCT 
AACCGTCTGT 
CGGTATCCGT 
CTGAAGTTCG 
CTGCGTTTCA 
CTACGTTGTT 
TGACCTCCCG 
CGTCGTCCGG 
CCGTGCTTGG 
CGGAACTGTA 
CCGCAGGACC 
CACCTACTGC 
ACGTTCGTAA 
CCGTACATGC 
TGACGCTGTT 
GTCTGTTCGA 
CGTGGTAAAT 
GTCCACCCTG 
TCGCTGGTAT 
CTGCTGGTTA 
GGTTCGTGGT 
TTGTTAACTT 
CAGATGCCGG 
CCGTACCCTG 
TCCGTGCTTC 
CGTCGTAAAC 
GGACCTGCAG 
TCCTGCTGCT 
TTCCACCAGC 
CGACACCGCT 
TGTCCCTGGG 
CAGTGGCTGT 
TACCTACGTT 
CCCGTAAACT 
CCGGCTCTGC 



CCGCTGTTCC 
TCTGCTGAAA 
GTGTTTGCGC 
GAAGACACCG 
CCCGTGGCAG 
CGCCGGGTCT 
ACCAAAAAAT 
ACTGACCTGG 
CGGGTGTTGG 
CTGGCTAAAT 
GCGTTCCTTC 
TCTTCTACCG 
CAGCACCTGA 
TCAGCACCGT 
TCCCGAAACC 
GGTGCTCGTA 
TGTTAAAGCT 
GTCTGCTGGG 
CGTACCTTCG 
CTTCGTTAAA 
GTCTGACCGA 
GTTCGTCGTT 
AGCTTTCAAA 
GTCAGTTCGT 
GTTATCGAAC 
CGTTTTCCTG 
CCTACGTTCA 
CTGTGCTCCC 
CCGTCGTGAC 
CCCCGCACCT 
GTTCCGGAAT 
CCCGGTTGAA 
CTCACGGTCT 
GAAGTTCAGT 
CCTGACCTTC 
TGTTCGGTGT 
GTTAACTCCC 
GCAGGCTTAC 
AGGTTTGGAA 
TCCCTGTGCT 
TGCTAAAGGT 
GCCACCAGGC 
CCGCTGCTGG 
GCCGGGTACC 
CGTCCGACTT 



TGGAACTGCT 
ACCCACTGCC 
TCGTGAJ^AAA 
ACCCGCGTCG 
GTTTACGGTT 
GTGGGGTTCC 
TCATCTCCCT 
AAAATGTCCG 
TTGCGTTCCG 
TCCTGCACTG 
TTCTACGTTA 
TAAATCCGTT 
AACGTGTTCA 
GAAGCT CGTC 
GGACGGTCTG 
CCTTCCGTCG 
CTGTTCTCCG 
TGCTTCCGTT 
TTCTGCGTGT 
GTTGAC GTTA 
AGTTATCGCT 
ACGCTGTTGT 
TCCCACGTTT 
TGCTCACCTG 
AGTCCTCCTC 
CGTTTCATGT 
GTGCCAGGGT 
TGTGCTACGG 
GGTCTGCTGC 
GACCCACGCT 
ACGGTTGCGT 
GACGAAGCTC 
GTTCCCGTGG 
CCGACTACTC 
AACCGTGGTT 
TCTGCGTCTG 
TGCAGACCGT 
CGTTTCCACG 
AAACCCGACC 
ACTCCATCCT 
GCTGCTGGTC 
TTTCCTGCTG 
GTTCCCTGCG 
ACCCTGACCG 
CAAAACCATC 



GGGTAACCAC 
CGCTGCGTGC 
CCGCAGGGTT 
TCTGGTTCAG 
TCGTTCGTGC 
CGTCACAACG 
GGGTAAACAC 
TTCGTGACTG 
GCTGCTGAAC 
GCTGATGTCC 
CCGAAACCAC 
TGGTCCAAAC 
GCTGCGTGAA 
CGGCTCTGCT 
CGTCCGATCG 
TGAAAAACGT 
TTCTGAACTA 
CTGGGTCTGG 
TCGTGCTCAG 
CCGGTGCTTA 
TCCATCATCA 
TCAGAAAGCT 
CCACCCTGAC 
CAGGAAACCT 
CCTGAACGAA 
GCCACCACGC 
ATCCCGCAGG 
TGACATGGAA 
TGCGTCTGGT 
AAAACCTTCC 
TGTTAACCTG 
TGGGTGGTAC 
TGCGGTCTGC 
CTCCTACGCT 
TCAAAGCTGG 
AAATGC CACT 
TTGCACCAAC 
CTTGCGTTCT 
TTCTTCCTGC 
GAAAGCTAAA 
CGCTGCCGTC 
AAACTGACCC 
TACCGCTCAG 
CTCTGGAAGC 
CTGGAC 



Table 9C 
Y east (Ml Genes) 

50 ATG CCA AGA GCT CCA AGA TGT AGA GCT GTT AGA TCT TTG 
AGA GAA GTT TTG CCA TTG GCT ACT TTT GTT AGA AGA TTG 
AGA TTG GTT CAA AGA GGT GAT CCA GCT GCT TTT AGA GCT 
TTG GTT TGT GTT CCA TGG GAT GCT AGA CCA CCA CCA GCT 
CAA GTT TCT TGT TTG AAA GAA TTG GTT GCT AGA GTT TTG 

55 AGA GGT GCT AAA AAT GTT TTG GCT TTT GGT TTT GCT TTG 



TTG 


AGA 


TCT 


CAT 


TAT 


GGT 


CCA 


CAA 


GGT 


TGG 


TTG 


GTT 


GCT 


CAA 


TGT 


GCT 


CCA 


TCT 


TTT 


AGA 


CAA 


AGA 


TTG 


TGT 


GAA 


TTG 


GAT 


GGT 


GCT 


AGA 
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GGT 


GGT 


CCA 


CCA 


GAA 


GCT 


GTT 


ACT 


GAT 


GCT 


TTG 


AGA 


GGT 


GAT 


GAT 


GTT 


TTG 


GTT 


GCT 


CCA 


TCT 


TGT 


GCT 


TAT 


GCT 


ACT 


CAA 


GCT 


AGA 


CCA 


TGT 


GAA 


AGA 


GCT 


TGG 


AAT 


CCA 


GCT 


CCA 


GGT 


GCT 


AGA 


CCA 


AAA 


AGA 


CCA 


AGA 


AGA 


CAA 


GGT 


TCT 


TGG 


GCT 


CAT 


TGT 


GTT 


GTT 


TCT 


CCA 


GCT 


TTG 


TCT 


GGT 


ACT 


AGA 


CAT 


CCA 


CCA 


TCT 


ACT 


TCT 


AGA 


TAT 


GCT 


GAA 


ACT 


AAA 


CAT 


CCA 


TCT 


TTT 


TTG 


TTG 


TCT 


GTT 


GAA 


ACT 


ATT 


TTT 


TTG 


TTG 


CCA 


AGA 


TTG 


CCA 


CAA 


TTG 


GGT 


AAT 


CAT 


GCT 


CAA 


TTG 


AGA 


GCT 


GCT 


GTT 


ACT 


GGT 


TCT 


GTT 


GCT 


GCT 


CCA 


TTG 


TTG 


AGA 


CAA 


CAT 


TCT 


TTG 


AGA 


AGA 


TTG 


GTT 


CCA 


TTT 


TTG 


AGA 


AAT 


ACT 


AAA 


TTG 


CAA 


GAA 


TTG 


ACT 


TGG 


TCT 


CCA 


GGT 


GTT 


GGT 


TGT 


TTG 


GCT 


AAA 


TTT 


TTG 


CAT 


TCT 


TTT 


TTT 


TAT 


GTT 


ACT 


AGA 


AAA 


TCT 


GTT 


TGG 


TCT 


AGA 


GTT 


CAA 


TTG 


AGA 


GAA 


AGA 


CCA 


GCT 


TTG 


TTG 


ACT 


AGA 


CCA 


ATT 


GTT 


AAT 


ATG 


AAA 


AGA 


GCT 


GAA 


AGA 


TTG 


TAT 


GAA 


AGA 


GCT 


AGA 


AGA 


GAT 


ATT 


CAT 


AGA 


GCT 


TGG 


CCA 


CCA 


GAA 


TTG 


TAT 


TTT 


CCA 


CAA 


GAT 


AGA 


TTG 


ACT 


TAT 


TGT 


GTT 


AGA 


AGA 


TAT 


AAA 


GCT 


TTT 


AAA 


TCT 


CAT 


CAA 


TTT 


GTT 


GCT 


CAT 


TTG 


GAA 


CAA 


TCT 


TCT 


TCT 


TTG 


AGA 


TTT 


ATG 


TGT 


CAT 


CAT 


CAA 


GGT 


ATT 


CCA 


CAA 


GGT 


GGT 


GAT 


ATG 


GAA 


AAT 


AAA 


AGA 


TTG 


GTT 


GAT 


GAT 


TTT 


TTT 


TTG 


AGA 


ACT 


TTG 


GTT 


AGA 


AAA 


ACT 


GTT 


GTT 


AAT 


TTT 


GTT 


CAA 


ATG 


CCA 


GCT 


ACT 


AGA 


ACT 


TTG 


GAA 


GTT 


AGA 


GCT 


TCT 


TTG 


ACT 


TTT 


AAA 


TTG 


TTT 


GGT 


GTT 


TTG 


GTT 


AAT 


TCT 


TTG 


CAA 


ACT 


GCT 


TAT 


AGA 


TTT 


CAT 


GCT 


AAA 


AAT 


CCA 


ACT 


TTT 


TTT 


TCT 


ATT 


TTG 


AAA 


GCT 


AAA 


GGT 


CCA 


TTG 


CCA 


TCT 


GAA 


AAA 


TTG 


ACT 


AGA 


CAT 


AGA 



TTT ACT ACT TCT GTT AGA 
GGT TCT GGT GCT TGG GGT 
CAT TTG TTG GCT AGA TGT 
CAA GTT TGT GGT CCA CCA 
CCA CCA CAT GCT TCT GGT 
CAT TCT GTT AGA GAA GCT 
AGA AGA GGT GGT TCT GCT 
GGT GCT GCT CCA GAA CCA 
CCA GGT AGA ACT AGA GGT 
AGA CCA GCT GAA GAA GCT 
TCT CAT CCA TCT GTT GGT 
CCA CCA AGA CCA TGG GAT 
TTT TTG TAT TCT TCT GGT 
TCT TTG AGA CCA TCT TTG 
GGT TCT AGA CCA TGG ATG 
AGA TAT TGG CAA ATG AGA 
TGT CCA TAT GGT GTT TTG 
CCA GCT GCT GGT GTT TGT 
GAA GAA GAA GAT ACT GAT 
TCT CCA TGG CAA GTT TAT 
CCA GGT TTG TGG GGT TCT 
AAA TTT ATT TCT TTG GGT 
AAA ATG TCT GTT AGA GAT 
GTT CCA GCT GCT GAA CAT 
TGG TTG ATG TCT GTT TAT 
GAA ACT ACT TTT CAA AAA 
AAA TTG CAA TCT ATT GGT 
TTG TCT GAA GCT GAA GTT 
TCT AGA TTG AGA TTT ATT 
GAT TAT GTT GTT GGT GCT 
ACT TCT AGA GTT AAA GCT 
CCA GGT TTG TTG GGT GCT 
AGA ACT TTT GTT TTG AGA 
GTT AAA GTT GAT GTT ACT 
GAA GTT ATT GCT TCT ATT 
GCT GTT GTT CAA AAA GCT 
GTT TCT ACT TTG ACT GAT 
CAA GAA ACT TCT CCA TTG 
AAT GAA GCT TCT TCT GGT 
GCT GTT AGA ATT AGA GGT 
TCT ATT TTG TCT ACT TTG 
TTG TTT GCT GGT ATT AGA 
TTG TTG GTT ACT CCA CAT 
AGA GGT GTT CCA GAA TAT 
TTT CCA GTT GAA GAT GAA 
CAT GGT TTG TTT CCA TGG 
CAA TCT GAT TAT TCT TCT 
AAT AGA GGT TTT AAA GCT 
AGA TTG AAA TGT CAT TCT 
GTT TGT ACT AAT ATT TAT 
TGT GTT TTG CAA TTG CCA 
TTG AGA GTT ATT TCT GAT 
AAT GCT GGT ATG TCT TTG 
GCT GTT CAA TGG TTG TGT 
GTT ACT TAT GTT CCA TTG 



TCT 


TAT 


TTG 


CCA 


AAT 


ACT 


TTG 


TTG 


TTG 


AGA 


AGA 


GTT 


GCT 


TTG 


TTT 


GTT 


TTG 


GTT 


TTG 


TAT 


CAA 


TTG 


GGT 


GCT 


CCA 


AGA 


AGA 


AGA 


TTG 


GGT 


GGT 


GTT 


CCA 


TTG 


GGT 


TTG 


TCT 


AGA 


TCT 


TTG 


CCA 


TTG 


GAA 


AGA 


ACT 


CCA 


GTT 


GGT 


CCA 


TCT 


GAT 


AGA 


GGT 


TTT 


ACT 


TCT 


TTG 


GAA 


GGT 


GCT 


AGA 


CAA 


CAT 


CAT 


GCT 


GGT 


ACT 


CCA 


TGT 


CCA 


CCA 


GTT 


(SAT 


AAA 


GAA 


CAA 


TTG 


AGA 


ACT 


GGT 


GCT 


AGA 


AGA 


TTG 


CCA 


GGT 


ACT 


CCA 


AGA 


AGA 


CCA 


TTG 


TTT 


TTG 


GAA 


TTG 


TTG 


AAA 


ACT 


CAT 


TGT 


CCA 


GCT 


AGA 


GAA 


AAA 


CCA 


CAA 


CCA 


AGA 


AGA 


TTG 


GTT 


CAA 


GGT 


TTT 


GTT 


AGA 


GCT 


TGT 


AGA 


CAT 


AAT 


GAA 


AGA 


AGA 


.AAA 


CAT 


GCT 


AAA 


TTG 


TCT 


TGT 


GCT 


TGG 


TTG 


AGA 


AGA 


AGA 


TTG 


AGA 


GAA 


GAA 


ATT 


GTT 


GTT 


GAA 


TTG 


TTG 


AGA 


AAT 


AGA 


TTG 


TTT 


TTT 


TAT 


ATT 


AGA 


CAA 


CAT 


TTG 


AAA 


AGA 


CAA 


CAT 


AGA 


GAA 


GCT 


CCA 


AAA 


CCA 


GAT 


GGT 


TTG 


AGA 


ACT 


TTT 


AGA 


AGA 


GAA 


TTG 


TTT 


TCT 


GTT 


TTG 


AAT 


TCT 


GTT 


TTG 


GGT 


TTG 


GAT 


GTT 


AGA 


GCT 


CAA 


GAT 


CCA 


GGT 


GCT 


TAT 


GAT 


ACT 


ATT 


ATT 


AAA 


CCA 


CAA 


AAT 


ACT 


GCT 


CAT 


GGT 


CAT 


GTT 


AGA 


TTG 


CAA 


CCA 


TAT 


ATG 


AGA 


AGA 


GAT 


GCT 


GTT 


GTT 


ATT 


TTG 


TTT 


GAT 


GTT 


TTT 


TTG 


AAA 


TCT 


TAT 


GTT 


CAA 


TGT 


TTG 


TGT 


TCT 


TTG 


TGT 


TAT 


AGA 


GAT 


GGT 


TTG 


TTG 


TTG 


TTG 


ACT 


CAT 


GCT 


AAA 


ACT 


GGT 


TGT 


GTT 


GTT 


AAT 


TTG 


GCT 


TTG 


GGT 


GGT 


ACT 


GCT 


TGT 


GGT 


TTG 


TTG 


TTG 


GAT 


TAT 


GCT 


AGA 


ACT 


TCT 


ATT 


GGT 


AGA 


AAT 


ATG 


AGA 


AGA 


TTG 


TTT 


TTG 


GAT 


TTG 


CAA 


AAA 


ATT 


TTG 


TTG 


TTG 


CAA 


TTT 


CAT 


CAA 


CAA 


GTT 


TGG 


ACT 


GCT 


TCT 


TTG 


TGT 


TAT 


GGT 


GCT 


AAA 


GGT 


GCT 


GCT 


CAT 


CAA 


GCT 


TTT 


TTG 


TTG 


TTG 


GGT 


TCT 


TTG 


AGA 


ACT 



74 








GCT 


CAA 


ACT 


CAA 


TTG 


TCT 


AGA 


AAA 


TTG 


CCA 


GGT 


ACT 


ACT 


TTG 


ACT 


GCT 


TTG 


GAA 






GCT 


GCT 


GCT 


AAT 


CCA 


GCT 


TTG 


CCA 


TCT 


GAT 


TTT 


AAA 


ACT 


ATT 


TTG 


GAT 








5 


















Table 9D 
































Yeast (High 


Expressing Genes) 
















ATG 


CCA 


AGA 


GCT 


CCA 


AGA 


TGT 


AGA 


GCT 


GTT 


AGA 


TCT 


TTG 


TTG 


AGA 


TCT 


CAC 


TAC 






AGA 


GAA 


GTT 


TTG 


CCA 


TTG 


GCT 


ACT 


TTC 


GTT 


AGA 


AGA 


TTG 


GGT 


CCA 


CAA 


GGT 


TGG 






AGA 


TTG 


GTT 


CAA 


AGA 


GGT 


GAC 


CCA 


GCT 


GCT 


TTC 


AGA 


GCT 


TTG 


GTT 


GCT 


CAA 


TnT 

X u X 




10 


TTG 

X X U 


GTT 

UX X 


TGT 


GTT 


CCA 


TGG 


GAC 


GCT 


AGA 


CCA 


CCA 


CCA 


GCT 


GCT 


CCA 


TCT 


TTC 


AGA 






Lnn. 


fJTT 
Ul x 


TPT 

X L X 


TnT 

XU X 


TTn 

X X U 


AAG 


GAA 


TTG 


GTT 


GCT 


AGA 


GTT 


TTG 


PAA 

Lnn 


ana 

nun 


TTn 

X xu 


TnT 

lux 


naa 

unn. 






an a 

nUn 


nnT 

UU X 


npT 

UL X 


aan 


AAP 

nnL 


GTT 


TTG 


GCT 


TTC 


GGT 


TTC 


GCT 


TTG 


TTn 

X XU 


nap 

UnL 


nnT 

UU X 


nPT 

UL X 


ana 

nUn 






UU X 


nnT 

UU X 


ppa 

V* Ln 


PPA 


GAA 


GCT 


TTC 


ACT 


ACT 


TCT 


GTT 


AGA 


TCT 


TAP 
XnL 


TTn 

X Xu 


ppa 

LLn 


aap 

HnU 


aPT 

AL 1 






OX X 


aPT 

nL X 


GAC 


GCT 


TTG 


AGA 


GGT 


TCT 


GGT 


GCT 


TGG 


GGT 


TTG 


TTn 

X XU 


TTn 

x xu 


ana 

nun 


ana 

nUn 


nTT 

ul 1 




15 


GGT 


GAC 


GAC 


GTT 


TTG 


GTT 


CAC 


TTG 


TTG 


GCT 


AGA 


TGT 


GCT 


TTn 

x xu 


TTP 
X X v_ 


nTT 

U X X 


TTn 

1 Xu 


nTT 

ul 1 






nPT 

UL X 


ppa 

LLn 


TPT 

X L X 


TnT 

X V71 


GCT 


TAC 


CAA 


GTT 


TGT 


GGT 


CCA 


CCA 


TTG 


Tap 


paa 


TTn 

x xu 


nnT 

uul 


nPT 

UL 1 






nPT 

UL X 


aPT 

nO X 


PAA 
win 


nPT 

UL x 


AGA 


CCA 


CCA 


CCA 


CAC 


GCT 


TCT 


GGT 


CCA 


ana 

nUn 


ana 

nUn 


ana 

nun 


TTn 

X Xu 


nnT 

UU 1 






TnT 

lul 


naa 


ana 

nun 


nPT 

X 


Tnn 

xuu 


AAP 

nnL 


CAC 


TCT 


GTT 


AGA 


GAA 


GCT 


GGT 


nTT 

UX X 


ppa 


TTn 

X XU 


nnT 

UU 1 


TTn 

1 lu 






ppa 


nPT 

UL X 


ppa 

LLn. 


GGT 


GCT 


AGA 


AGA 


AGA 


GGT 


GGT 


TCT 


GCT 


TCT 


ana 


TPT 


TTn 

x xu 


ppa 

LLn 


TTn 

1 lu 


;3 


20 


PPA 
LLn 


aan 

n>iu 


ana 

nun 


ppa 


AGA 


AGA 


GGT 


GCT 


GCT 


CCA 


GAA 


CCA 


GAA 


ana 

nun 


aPT 

nL. X 


ppa 


nTT 

u X 1 


nnT 

uul 






paa 


UU X 


TPT 


Tnn 

X UU 


npT 

X 


pap 


CCA 


GGT 


AGA 


ACT 


AGA 


GGT 


CCA 


TPT 
X L X 


nap 


ana 

nun 


nnT 

uu x 


1 1 L 


id 




TnT 
-LUX 


P.TT 
Ul X 


nTT 

vJJ. X 


TPT 


ppa 
LLn 


nPT 

UU X 


AGA 


CCA 


GCT 


GAA 


GAA 


GCT 


ACT 


TPT 


TTn 

X XU 


na a 

Unn 


nnT 

uul 


PPT 
uL 1 


'4 




rnrnr* 

x x u 


TPT 


GGT 


ACT 


AGA 


CAC 


TCT 


CAC 


CCA 


TCT 


GTT 


GGT 


AGA 


CAA 


pap 

LnU 


pap 


nPT 

UL X 


nnT 

Uu X 






CCA 


CCA 


TCT 


ACT 


TCT 


AGA 


CCA 


CCA 


AGA 


CCA 


TGG 


GAC 


ACT 


CCA 


TGT 


CCA 


CCA 


GTT 


• = 


25 


Tan 

XnL 


GPT 

ul x 


GAA 


ACT 


AAG 


CAC 


TTC 


TTG 


TAC 


TCT 


TCT 


GGT 


GAC 


AAn 


naa 

Unn 


PAA 
Lnn 


TTn 

X xu 


ana 

nun 






PPA 

LLn 


TPT 


TTC 


TTG 


TTG 


TCT 


TCT 


TTG 


AGA 


CCA 


TCT 


TTG 


ACT 


nnT 

ou x 


nPT 

UL X 


ana 

nun 


Ana 

nun 


TTn 

1 X U 


i 




GTT 


GAA 


ACT 


ATT 


TTC 


TTG 


GGT 


TCT 


AGA 


CCA 


TGG 


ATG 


CCA 


GGT 


ACT 


CCA 


AGA 


AGA 






TTG 


CCA 


AGA 


TTG 


CCA 


CAA 


AGA 


TAC 


TGG 


CAA 


ATG 


AGA 


CCA 


TTG 


TTC 


TTG 


GAA 


TTn 

X xu 






TTG 


GGT 


AAC 


CAC 


GCT 


CAA 


TGT 


CCA 


TAC 


GGT 


GTT 


TTG 


TTG 


AAG 


ACT 


CAC 


TGT 


CCA 


A 


30 


TTG 


AGA 


GCT 


GCT 


GTT 


ACT 


CCA 


GCT 


GCT 


GGT 


GTT 


TGT 


GCT 


AGA 


GAA 


AAG 


CCA 


CAA 






GGT 


TPT 

X L X 


GTT 


nPT 

UL X 


GCT 


CCA 


GAA 


GAA 


GAA 


GAC 


ACT 


GAC 


CCA 


ana 

nun. 


ana 

nun 


TTn 

1 Xu 


PTT 
ul 1 


paa 

Lnn 






TTn 
X X U 


TTn 


ana 

nUn 


paa 

Lnn 


PAP 

LnU 


TPT 

X L X 


TCT 


CCA 


TGG 


CAA 


GTT 


TAC 


GGT 


TTP 


nTT 

U X X 


ana 

nUn 


nPT 

uL X 


TnT 

lul 






TTn 
x lu 


ana 

nun 


ana 


TTn 

X X U 


nTT 

U X X 


PPA 
LLn 


CCA 


GGT 


TTG 


TGG 


GGT 


TCT 


AGA 


pap 

LnL 


aap 

nnA— 


naa 


ana 

AuA 


ana 

AuA 






TTP 


TTn 
X X u 


ana 

nun 


aap 


APT 
nL X 


aan 

nnu 


AAG 


TTC 


ATT 


TCT 


TTG 


GGT 


AAG 


pap 


PPT 
uV_ 1 


aap 

nnu 


TTP 
1 lu 


TPT 
iLl 






1 xu 


paa 

Lnn 


naa 


TTn 

X X U 


aPT 

nL X 


TGG 


AAG 


ATG 


TCT 


GTT 


AGA 


GAC 


TGT 


nPT 
ut. x 


Tnn 

Xuu 


TTn 

X Xu 


ana 

nun 


ana 

nun 






i. L X 


ppa 

LLn 


nnT 


nTT 

UX X 


nnT 


TnT 


GTT 


CCA 


GCT 


GCT 


GAA 


CAC 


AGA 


TTn 

1 lu 


ana 

nun 


naa 

unn 


na a 

uAA 


ATT 
Al 1 






TTn 
x x u 


npT 

UL x 


aan 

nnU 


TTP 
X X L 


TTn 


pap 

UnL 


TGG 


TTG 


ATG 


TCT 


GTT 


TAC 


GTT 


PTT 


paa 

unn 


TTP 
1 lu 


TTP 
1 lu 


apa 

AuA 






•PPT 


TTP 
X xl 


TTP 


TAP 
XnL 


nTT 

U X X 


aPT 

nL X 


GAA 


ACT 


ACT 


TTC 


CAA 


AAG 


AAC 


ana 

nun 


TTf2 
X Xu 


TTP 


TTP 
1 1 L 


Tap 

1AL 






ana 

nun 


aan 


TPT 

X L X 


nTT 

U X X 


Tnn 

X UU 


TPT 


AAG 


TTG 


CAA 


TCT 


ATT 


GGT 


ATT 


ana 

nUn 


paa 

V,nn 


pap 

LnC 


TTfi 
1 lu 


aan 

nnu 




40 


ana 

nun 


nTT 


paa 

von 


TTn 

X XU 


ana 

nun 


naa 

unn 


TTG 


TCT 


GAA 


GCT 


GAA 


GTT 


AGA 


paa 


pap 


ana 

nun 


paa 

unA. 


PPT 
uL 1 






ana 

nuA 


ppa 

LLn 


nPT 

X 


TTn 

X X U 


TTn 

X Xu 


aPT 

nL X 


TCT 


AGA 


TTG 


AGA 


TTC 


ATT 


CCA 


aan 


ppa 


nap 

UnL 


pnT 

uul 


TTn 
1 lu 






ana 

AuA 


PPA 


ATT 


ul 1 


aap 

nnL 


a tp 


GAC 


TAC 


GTT 


GTT 


GGT 


GCT 


AGA 


7V fiT 


TTC 


AuA 


7V/"*7\ 

AuA 


uAA 






J, 7\p 

nAu 




PPT 

ul i 


paa 


ana 

AuA 


TTP 


ACT 


TCT 


AGA 


GTT 


AAG 


GCT 


TTG 


i XL 


TPT 
itl 


ul 1 


1 lu 


AAL 








paa 

Win 


apa 

nun 


PPT 
uL 1 


ana 


ana 

AufV 


CCA 


GGT 


TTG 


TTG 


GGT 


GCT 


TCT 


ul 1 


1 lu 


uul 


1 itj 


LtAL 






pap 


ill 1 


pap 


apa 

AuA 


PPT 

uL X 


Tnp 


AGA 


ACT 


TTC 


GTT 


TTG 


AGA 


GTT 


AuA 


(jLI 


C-nn 


uAL 


r*r*a 
LLA 






ppa 


ppa 


naa 
unn 


TTn 


Tap 
inL 


TTP 
1 1 L 


GTT 


AAG 


GTT 


GAC 


GTT 


ACT 


GGT 


uLl 


rp7\ ft 
1 AL 


bAL 


ALT 


Al x 






LlA 




pap 

UAL. 


AuA 


x lb 


n r^T 
AL1 


GAA 


GTT 


ATT 


GCT 


TCT 


ATT 


ATT 


T\ 71 

AAG 


LLA 


CAA 


TV 7\ 

AAC 


ALT 






Tap 


TnT 


nTT 

Ul X 


ana 

nun. 


ana 
nun 


Tap 

InL 


GCT 


GTT 


GTT 


CAA 


AAG 


GCT 


GCT 


LAL 


uul 


UAL. 


/irnrp 
ul 1 


apa 
AuA 








PPT* 
uL X 


TTC 


aap 

AAu 


TPT 


L/iL 


GTT 


TCT 


ACT 


TTG 


ACT 


GAC 


TTG 


CAA 


CCA 


TAC 


ATG 


AGA 




50 


CAA 


TTC 


GTT 


GCT 


CAC 


TTG 


CAA 


GAA 


ACT 


TCT 


CCA 


TTG 


AGA 


GAC 


GCT 


GTT 


GTT 


ATT 






GAA 


CAA 


TCT 


TCT 


TCT 


TTG 


AAC 


GAA 


GCT 


TCT 


TCT 


GGT 


TTG 


TTC 


GAC 


GTT 


TTC 


TTG 






AGA 


TTC 


ATG 


TGT 


CAC 


CAC 


GCT 


GTT 


AGA 


ATT 


AGA 


GGT 


AAG 


TCT 


TAC 


GTT 


CAA 


TGT 






CAA 


GGT 


ATT 


CCA 


CAA 


GGT 


TCT 


ATT 


TTG 


TCT 


ACT 


TTG 


TTG 


TGT 


TCT 


TTG 


TGT 


TAC 






GGT 


GAC 


ATG 


GAA 


AAC 


AAG 


TTG 


TTC 


GCT 


GGT 


ATT 


AGA 


AGA 


GAC 


GGT 


TTG 


TTG 


TTG 




55 


AGA 


TTG 


GTT 


GAC 


GAC 


TTC 


TTG 


TTG 


GTT 


ACT 


CCA 


CAC 


TTG 


ACT 


CAC 


GCT 


AAG 


ACT 



75 



11 11 'ini' ni iv " 




TTC 


TTG 


AGA 


ACT 


TTG 


GTT 


AGA 


GGT 


GTT 


CCA 


GAA 


TAC 


GGT 


TGT 


GTT 


GTT 


AAC 


TTG 


AGA 


AAG 


ACT 


GTT 


GTT 


AAC 


TTC 


CCA 


GTT 


GAA 


GAC 


GAA 


GCT 


TTG 


GGT 


GGT 


ACT 


GCT 


TTC 


GTT 


CAA 


ATG 


CCA 


GCT 


CAC 


GGT 


TTG 


TTC 


CCA 


TGG 


TGT 


GGT 


TTG 


TTG 


TTG 


GAC 


ACT 


AGA 


ACT 


TTG 


GAA 


GTT 


CAA 


TCT 


GAC 


TAC 


TCT 


TCT 


TAC 


GCT 


AGA 


ACT 


TCT 


ATT 


AGA 


GCT 


TCT 


TTG 


ACT 


TTC 


AAC 


AGA 


GGT 


TTC 


AAG 


GCT 


GGT 


AGA 


AAC 


ATG 


AGA 


AGA 


AAG 


TTG 


TTC 


GGT 


GTT 


TTG 


AGA 


TTG 


AAG 


TGT 


CAC 


TCT 


TTG 


TTC 


TTG 


GAC 


TTG 


CAA 


GTT 


AAC 


TCT 


TTG 


CAA 


ACT 


GTT 


TGT 


ACT 


AAC 


ATT 


TAC 


AAG 


ATT 


TTG 


TTG 


TTG 


CAA 


GCT 


TAC 


AGA 


TTC 


CAC 


GCT 


TGT 


GTT 


TTG 


CAA 


TTG 


CCA 


TTC 


CAC 


CAA 


CAA 


GTT 


TGG 


AAG 


AAC 


CCA 


ACT 


TTC 


TTC 


TTG 


AGA 


GTT 


ATT 


TCT 


GAC 


ACT 


GCT 


TCT 


TTG 


TGT 


TAC 


TCT 


ATT 


TTG 


AAG 


GCT 


AAG 


AAC 


GCT 


GGT 


ATG 


TCT 


TTG 


GGT 


GCT 


AAG 


GGT 


GCT 


GCT 


GGT 


CCA 


TTG 


CCA 


TCT 


GAA 


GCT 


GTT 


CAA 


TGG 


TTG 


TGT 


CAC 


CAA 


GCT 


TTC 


TTG 


TTG 


AAG 


TTG 


ACT 


AGA 


CAC 


AGA 


GTT 


ACT 


TAC 


GTT 


CCA 


TTG 


TTG 


GGT 


TCT 


TTG 


AGA 


ACT 


GCT 


CAA 


ACT 


CAA 


TTG 


TCT 


AGA 


AAG 


TTG 


CCA 


GGT 


ACT 


ACT 


TTG 


ACT 


GCT 


TTG 


GAA 


GCT 


GCT 


GCT 


AAC 


CCA 


GCT 


TTG 


CCA 


TCT 


GAC 


TTC 


AAG 


ACT 


ATT 


TTG 


GAC 
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Table 9E 

"Generic" hTRT Protein Encoding Sequence 

20 ATG CCA CGT GCC CCA CGT TGT CGT GCC GTT CGT TCT TTG TTG CGT TCT CAC TAC CGT 
GAA GTT TTG CCA TTG GCC ACC TTC GTT CGT CGT TTG GGT CCA CAA GGT TGG CGT TTG 
GTT CAA CGT GGT GAT CCA GCC GCC TTC CGT GCC TTG GrTT GCC CAA TGT TTG GTT TGT 
GTT CCA TGG GAT GCC CGT CCA CCA CCA GCC GCC CCA TCT TTC CGT CAA GTT TCT TGT 
TTG AAA GAA TTG GTT GCC CGT GTT TTG CAA CGT TTG TGT GAA CGT GGT GCC AAA AAC 

25 GTT TTG GCC TTC GGT TTC GCC TTG TTG GAT GGT GCC CGT GGT GGT CCA CCA GAA GCC 
TTC ACC ACC TCT GTT CGT TCT TAC TTG CCA AAC ACC GTT ACC GAT GCC TTG CGT GGT 
TCT GGT GCC TGG GGT TTG TTG TTG CGT CGT GTT GGT GAT GAT GTT TTG GTT CAC TTG 
TTG GCC CGT TGT GCC TTG TTC GTT TTG GTT GCC CCA TCT TGT GCC TAC CAA GTT TGT 
GGT CCA CCA TTG TAC CAA TTG GGT GCC GCC ACC CAA GCC CGT CCA CCA CCA CAC GCC 

30 TCT GGT CCA CGT CGT CGT TTG GGT TGT GAA CGT GCC TGG AAC CAC TCT GTT CGT GAA 
GCC GGT GTT CCA TTG GGT TTG CCA GCC CCA GGT GCC CGT CGT CGT GGT GGT TCT GCC 
TCT CGT TCT TTG CCA TTG CCA AAA CGT CCA CGT CGT GGT GCC GCC CCA GAA CCA GAA 
CGT ACC CCA GTT GGT CAA GGT TCT TGG GCC CAC CCA GGT CGT ACC CGT GGT CCA TCT 
GAT CGT GGT TTC TGT GTT GTT TCT CCA GCC CGT CCA GCC GAA GAA GCC ACC TCT TTG 

35 GAA GGT GCC TTG TCT GGT ACC CGT CAC TCT CAC CCA TCT GTT GGT CGT CAA CAC CAC 
GCC GGT CCA CCA TCT ACC TCT CGT CCA CCA CGT CCA TGG GAT ACC CCA TGT CCA CCA 
GTT TAC GCC GAA ACC AAA CAC TTC TTG TAC TCT TCT GGT GAT AAA GAA CAA TTG CGT 
CCA TCT TTC TTG TTG TCT TCT TTG CGT CCA TCT TTG ACC GGT GCC CGT CGT TTG GTT 
GAA ACC ATT TTC TTG GGT TCT CGT CCA TGG ATG CCA GGT ACC CCA CGT CGT TTG CCA 

40 CGT TTG CCA CAA CGT TAC TGG CAA ATG CGT CCA TTG TTC TTG GAA TTG TTG GGT AAC 
CAC GCC CAA TGT CCA TAC GGT GTT TTG TTG AAA ACC CAC TGT CCA TTG CGT GCC GCC 
GTT ACC CCA GCC GCC GGT GTT TGT GCC CGT GAA AAA CCA CAA GGT TCT GTT GCC GCC 
CCA GAA GAA GAA GAT ACC GAT CCA CGT CGT TTG GTT CAA TTG TTG CGT CAA CAC TCT 
TCT CCA TGG CAA GTT TAC GGT TTC GTT CGT GCC TGT TTG CGT CGT TTG GTT CCA CCA 

45 GGT TTG TGG GGT TCT CGT CAC AAC GAA CGT CGT TTC TTG CGT AAC ACC AAA AAA TTC 
ATT TCT TTG GGT AAA CAC GCC AAA TTG TCT TTG CAA GAA TTG ACC TGG AAA ATG TCT 
GTT CGT GAT TGT GCC TGG TTG CGT CGT TCT CCA GGT GTT GGT TGT GTT CCA GCC GCC 
GAA CAC CGT TTG CGT GAA GAA ATT TTG GCC AAA TTC TTG CAC TGG TTG ATG TCT GTT 
TAC GTT GTT GAA TTG TTG CGT TCT TTC TTC TAC GTT ACC GAA ACC ACC TTC CAA AAA 

50 AAC CGT TTG TTC TTC TAC CGT AAA TCT GTT TGG TCT AAA TTG CAA TCT ATT GGT ATT 
CGT CAA CAC TTG AAA CGT GTT CAA TTG CGT GAA TTG TCT GAA GCC GAA GTT CGT CAA 
CAC CGT GAA GCC CGT CCA GCC TTG TTG ACC TCT CGT TTG CGT TTC ATT CCA AAA CCA 
GAT GGT TTG CGT CCA ATT GTT AAC ATG GAT TAC GTT GTT GGT GCC CGT ACC TTC CGT 
CGT GAA AAA CGT GCC GAA CGT TTG ACC TCT CGT GTT AAA GCC TTG TTC TCT GTT TTG 

55 AAC TAC GAA CGT GCC CGT CGT CCA GGT TTG TTG GGT GCC TCT GTT TTG GGT TTG GAT 
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GAT 


ATT 
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CGT 
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Following determination of the desired nucleotide sequence for the hTRT 
protein-encoding polynucleotide, the polynucleotide can be made by any suitable method 
including de novo chemical synthesis, directed mutagenesis of a synthetic or naturally 
occurring TRT gene or cDNA, or a combination of these methods. In one exemplary 
embodiment, oligonucleotides (typically 50-100 bases in length) are synthesized with a 5 f 
phosphate group and include approximately 10-base overhangs (relative to adjacent 
oligonucleotides in the assembled gene) to direct subsequent ligations. Following 
purification and desalting, each oligonucleotide is annealed to its complement (e.g., by 
combining pairs of oligonucleotides in equimolar amounts iin a neutral pH buffer with 
50-200 mM NaCl and 0.5 mM MgCl 2 ). Annealing may be monitored by native PAGE. The 
resulting double-stranded oligonucleotides are ligated to their neighbors in pairs. After each 
ligation the products are gel-purified, then ligated to the appropriate (neighboring) double- 
stranded DNAs. In this manner, fragments of approximately 600-800 basepairs are built up. 
These intermediate fragments are then cloned into vectors and sequenced. The fragments are 
then combined into a single vector (resulting in a vector containing a polynucleotide with the 
desired hTRT protein-encoding sequence). This step is facilitated by using restriction sites 
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present in, or engineered into, the polynucleotide sequence. Alternatively, the fragments can 
be built up by ligation until the complete cDNA is assembled and the assembled sequence 
cloned into a vector. Numerous other alternative methods and approaches will be apparent to 
those of skill in the art. 

Table 10A shows an exemplary set of oligonucleotide that can be used to 
produce a polynucleotide, shown in Table 10B, that employs a codon distribution 
preferentially used by highly expressed genes in K coli. The sequence in Table 5B contains 
silent changes to some codons to introduce useful restriction sites spaced every 300-800 base 
pairs, to facilitate subcloning and modification. Oligonucleotide pairs for the initial 
annealing steps are indicated by the labels "T" (top strand) and "B" (bottom strand). The full- 
length polynucleotide (Table 10B) encodes the hTRT protein (with the start codon at 
nucleotides 28-30) and contains Sac I and Xho I sites at the termini flanking the open reading 
frame, which are useful for cloning into a variety of vectors (e.g., pBluescript II KS, 
Stratagene Inc., San Diego CA). Once cloned into an appropriate vector, the hTRT sequence 
may be expressed, modified (e.g., by site directed or cassette mutagenesis), subcloned, or 
otherwise used or manipulated. In one embodiment, the poly nucleotide is subcloned into a 
pET vector containing a T7 RNA polymerase promoter (Novagen Inc., Madison, WI) and 
introduced into an K coli strain having an inducible T7 polymerase (Novagen Inc., Madison, 
WI). One advantage to the pET system is that the E. coli culture may be grown before the T7 
RNA polymerase gene is induced, resulting in very high levels of transcription and 
minimizing the effect of any potential detrimental effect of the expressed protein on the cells. 

TABLE 10 

SYNTHESIS OF hTRT POLYNUCLEOTIDE HAVING ALTERNATIVE 

CODON DISTRIBUTION 

Table 1QA; Oligonucleotides 

IB CCAGCGGCAGAACTTCGCGATAGTGGGAACGCAGCAGCjGAACGAACAGCACGGCAACGCG 

GAGCACGCGGCATATGGTCGACTCTAGAGCTCCCGCGTGC 
IT GCACGCGGGAGCTCTAGAGTCGACCATATGCCGCGTGCTCCGCGTTGCCGTGCTGTTCGTTC 

CCTGCTGCGTTCCCACTATCGCGAAGTT 
2B GGCACTGAGCAACCAGAGCACGGAAAGCAGCCGGGTCACCACGCTGAACCAGACGCCAAC 

CCTGCGGGCCCAGACGACGAACGAAGGTAG 
2T CTGCCGCTGGCTACCTTCGTTCGTCGTCTGGGCCCGCAGGGTTGGCGTCTGGTTCAGCGTGG 
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TGACCCGGCTGCTTTCCGTGCTCTGGTT 
3B GAACACGAGCAACCAGTTCTTTCAGGCAGGAAACCTGACGGAAGGACGGAGCAGCCGGCG 

GCGGACGAGCGTCCCACGGAACGCAAACCA 
3T GCTCAGTGCCTGGTTTGCGTTCCGTGGGACGCTCGTCCGCCGCCGGCTGCTCCGTCCTTCCGT 

5 CAGGTTTCCTGCCTGAAAGAACTGGTT 

4B ATGCTTCCGGCGGACCACCACGAGCACCGTCCAGCAGAGCGAAACCGAAAGCCAGAACGTT 

TTTAGCACCACGTTCGCACAGACGCTGCA 
4T GCTCGTGTTCTGCAGCGTCTGTGCGAACGTGGTGCTAAAAACGTTCTGGCTTTCGGTTTCGC 

TCTGCTGGACGGTGCTCGTGGTGGTCCG 
1 0 5B CAACACGACGCAGCAGCAG ACCCC AAGCACCGGAACCACGCAGAGCGTCGGTAACGGTGTT 

CGGCAGGTAGGAACGAACGGAGGTGGTGA 
5T CCGGAAGCATTCACCACCTCCGTTCGTTCCTACCTGCCGAACACCGTTACCGACGCTCTGCG 

TGGTTCCGGTGCTTGGGGTCTGCTGCTG 
6B GCGGCGGACCACAAACCTGGTAAGCGCAGGACGGAGCAACCAGAACGAACAGAGCGCAAC 

15 GAGCCAGCAGGTGAACCAGAACGTCGTCAC 

6T CGTCGTGTTGGTGACGACGTTCTGGTTCACCTGCTGGCTCGTTGCGCTCTGTTCGTTCTGGTT 

GCTCCGTCCTGCGCTTACCAGGTTTGT 
7B GGTTCCAAGCACGTTCGCAACCCAGACGACGACGCGGACCGGAAGCGTGCGGCGGCGGAC 

GAGCCTGGGTAGCAGCACCCAGCTGGTACA 
20 7T GGTCCGCCGCTGTACCAGCTGGGTGCTGCTACCCAGGCTCGTCCGCCGCCGCACGCTTCCGG 
TCCGCGTCGTCGTCTGGGTTGCGAACGT 
8B QCAGCGGCAGGGAACGGGAAGCGGAACCACCACGACGACGAGCACCCGGAGCCGGCAGAC 

CCAGCGGAACACCAGCTTCACGAACGGAGT 
8T GCTTGGAACCACTCCGTTCGTGAAGCTGGTGTTCCGCTGGGTCTGCCGGCTCCGGGTGCTCG 

25 TCGTCGTGGTGGTTCCGCTTCCCGTTCC 

9B GACCACGGGTACGACCCGGGTGAGCCCAGGAACCCTGACCAACCGGGGTACGTTCCGGTTC 

CGGAGCAGCACCACGACGCGGACGTTTCG 
9T CTGCCGCTGCCGAAACGTCCGCGTCGTGGTGCTGCTCCGGAACCGGAACGTACCCCGGTTGG 

TCAGGGTTCCTGGGCTCACCCGGGTCGT 
30 10B AGTGACGGGTGCCGGACAGAGCACCTTCCAGGGAGGTAGCTTCTTCAGCCGGACGAGCCGG 

GGAAACAACGCAGAAACCACGGTCGGACG 
10T ACCCGTGGTCCGTCCGACCGTGGTTTCTGCGTTGTTTCCCCGGCTCGTCCGGCTGAAGAAGC 

TACCTCCCTGGAAGGTGCTCTGTCCGGC 
11B AAACCGGCGGGCACGGGGTGTCCCACGGACGCGGCGGACGGGAGGTGGACGGCGGACCAG 

35 CGTGGTGCTGACGACCAACGGACGGGTGGG 

11T ACCCGTCACTCCCACCCGTCCGTTGGTCGTCAGCACCACGCTGGTCCGCCGTCCACCTCCCG 

TCCGCCGCGTCCGTGGGACACCCCGTGC 
12B TCAGGGACGGACGCAGGGAGGACAGCAGGAAGGACGGACGCAGCTGTTCTTTGTCACCGG 

AGGAGTACAGGAAGTGTTTGGTTTCAGCGT 
40 12T CCGCCGGTTTACGCTGAAACCAAACACTTCCTGTACTCCTCCGGTGACAAAGAACAGCTGCG 

TCCGTCCTTCCTGCTGTCCTCCCTGCGT 
13B GCTGCGGCAGACGCGGCAGACGACGCGGGGTGCCCGGCATCCACGGACGGGAACCCAGGA 

AGATAGTTTCAACCAGACGACGAGCACCGG 
13T CCGTCCCTGACCGGTGCTCGTCGTCTGGTTGAAACTATCTTCCTGGGTTCCCGTCCGTGGATG 

45 CCGGGCACCCCGCGTCGTCTGCCGCGT 

14B GCGGGCAGTGGGTTTTCAGCAGAACACCATACGGGCACTGAGCGTGGTTGCCCAGCAGTTC 

CAGGAACAGCGGACGCATCTGCCAGTAAC 
14T CTGCCGCAGCGTTACTGGCAGATGCGTCCGCTGTTCCTGGAACTGCTGGGCAACCACGCTCA 

GTGCCCGTATGGTGTTCTGCTGAAAACC 

50 15B GGTCGGTATCTTCTTCTTCCGGAGCAGCAACGGAACCCTGCGGTTTTTCACGAGCGCAAACA 

CCAGCAGCCGGGGTAACAGCAGCACGCA 
1 5T CACTGCCCGCTGCGTGCTGCTGTTACCCCGGCTGCTGGTGTTTGCGCTCGTGAAAAACCGC A 

GGGTTCCGTTGCTGCTCCGGAAGAAGAA 
16B GCGGAACCAGACGACGCAGGCATGCACGAACGAAACCGTAAACCTGCCACGGGGAGGAGT 

55 GCTGACGCAGCAGCTGAACCAGACGACGCG 
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16T GATACCGACCCGCGTCGTCTGGTTCAGCTGCTGCGTCAGCACTCCTCCCCGTGGCAGGTTTA 

CGGTTTCGTTCGTGCATGCCTGCGTCGT 
17B GGGACAGTTTAGCGTGTTTACCCAGGGAGATGAATTTTTTGGTGTTACG 

TCGTTGTGACGGGAACCCCACAGACCCG 
5 1 7T CTGGTTCCGCCGGGTCTGTGGGGTTCCCGTCACAACGAACGTCGTTTCCTGCGTAACACCAA 

AAAATTCATCTCCCTGGGTAAACACGCT 
18B GGTGTTCAGCAGCCGGAACGCAACCAACACCCGGAGAACGACGCAGCCAAGCGCAGTCAC 

GAACGGACATTTTCCAGGTCAGTTCCTGCA 
18T AAACTGTCCCTGCAGGAACTGACCTGGAAAATGTCCGTTCGTGACTGCGCTTGGCTGCGTCG 
1 0 TTCTCCGGGTGTTGGTTGCGTTCCGGCT 

19B CGGTAACGTAGAAGAAGGAACGCAGCAGTTCAACAACGTATACGGACATCAGCCAGTGCA 

GGAATTTAGCCAGGATTTCTTCACGCAGAC 
19T GCTGAACACCGTCTGCGTGAAGAAATCCTGGCTAAATTCCTGCACTGGCTGATGTCCGTATA 

CGTTGTTGAACTGCTGCGTTCCTTCTTC 
1 5 20B GTTTCAGGTGCTGACGGATACCGATGGACTGCAGTTTGG ACCAAACGGATTTACGGTAG AA 

GAACAGACGGTTTTTCTGGAAGGTGGTTT 
20T TACGTTACCGAAACCACCTTCCAGAAAAACCGTCTGTTC1[TCTACCGTAAATCCGTTTGGTC 

CAAACTGCAGTCCATCGGTATCCGTCAG 
2 1 B G ATG AAACGCAGACGGG AGGTCAGCAGAGCCGGACG AGCTTCACGGTGCTGACGAACTTCA 
20 GCTTCGGACAGTTCACGCAGCTGAACAC 

2iT CACCTGAAACGTGTTCAGCTGCGTGAACTGTCCGAAGCTGAAGTTCGTCAGCACCGTGAAG 

CTCGTCCGGCTCTGCTGACCTCCCGTCTG 
22B TCAGACGCTCAGCACGTTTTTCACGACGGAAGGTACGAGCACCAACAACGTAGTCCATGTTT 

ACGATCGGACGCAGACCGTCCGGTTTCG 
25 22T CGTTTCATCCCGAAACCGGACGGTCTGCGTCCGATCGTAAACATGGACTACGTTGTTGGTGC 

TCGTACCTTCCGTCGTGAAAAACGTGCT 
23B CGTCCAGACCCAGAACGGAAGCACCCAGCAGACCCGGACGACGAGCACGTTCGTAGTTCAG 

AACGGAGAACAGAGCTTTAACACGGGAGG 
23T GAGCGTCTGACCTCCCGTGTTAAAGCTCTGTTCTCCGTTCTGAACTACGAACGTGCTCGTCG 
30 TCCGGGTCTGCTGGGTGCTTCCGTTCTG 

24B CGGTAACGTCAACTTTAACGAAGTACAGTTCCGGCGGCCJGGTCCTGAGCACGAACACGCAG 

AACGAAGGTACGCCAAGCACGGTGGATGT 
24T GGTCTGGACGACATCCACCGTGCTTGGCGTACCTTCGTTCTGCGTGTTCGTGCTCAGGACCC 

GCCGCCGGAACTGTACTTCGTTAAAGTT 
35 25B CGTAACGACGAACGCAGTAGGTGTTCTGCGGTTTGATG^iTGGAAGCGATAACTTCGGTCAG 

ACGGTCCTGCGGGATGGTGTCGTACGCGC 
25T GACGTTACCGGCGCGTACGACACCATCCCGCAGGACCGTCTGACCGAAGTTATCGCTTCCAT 

CATCAAACCGCAGAACACCTACTGCGTT 
26B GACGCATGTACGGCTGCAGGTCGGTCAGGGTGGAAACGTGGGATTTGAATGCTTTACGAAC 
40 GTGACCGTGAGCAGCTTTCTGAACAACAG 

26T CGTCGTTACGCTGTTGTTCAGAAAGCTGCTCACGGTCACGTTCGTAAAGCATTCAAATCCCA 

CGTTTCCACCCTGACCGACCTGCAGCCG 
27B GACCGGAGGAAGCTTCGTTCAGGGAGGAGGACTGTTCGATAACAACAGCGTCACGCAGCGG 

GGAGGTTTCCTGCAGGTGAGCAACGAACT 
45 27T TACATGCGTCAGTTCGTTGCTCACCTGCAGGAAACCTCCICCGCTGCGTGACGCTGTTGTTAT 

CGAACAGTCCTCCTCCCTGAACGAAGCT 
28B AACCCTGCGGGATACCCTGGCACTGAACGTAGGATTTACCACGGATACGAACAGCGTGGTG 

GCACATGAAACGCAGGAAAACGTCGAACA 
28T TCCTCCGGTCTGTTCGACGTTTTCCTGCGTTTCATGTGCCACCACGCTGTTCGTATCCGTGGT 
50 AAATCCTACGTTCAGTGCCAGGGTATC 

29B GCAGCAGCAGACCGTCACGACGGATACCAGCGAACAG'rTTGTTTTCCATGTCACCGTAGCA 

CAGGGAGCACAGCAGGGTGGACAGGATGG 
29T CCGCAGGGTTCCATCCTGTCCACCCTGCTGTGCTCCCTGTGCTACGGTGACATGGAAAACAA 

ACTGTTCGCTGGTATCCGTCGTGACGGT 
55 30B CGTATTCCGGAACACCACGAACCAGGGTACGCAGGAAGGTTTTAGCGTGGGTCAGGTGCGG 
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AGTAACCAGCAGGAAGTCGTCAACCAGAC 

CTGCTGCTGCGTCTGGTTGACGACTTCCTGCTGGTTACTCCGCACCTGACCCACGCTAAAAC 

CTTCCTGCGTACCCTGGTTCGTGGTGTT 
3 IB GAGCCGGCATCTGAACGAAAGCGGTGCCACCCAGAGCTl'CGTCTTCAACCGGGAAGTTAAC 

AACGGTTTTACGCAGGTTTACAACGCAAC 

CCGGAATACGGTTGCGTTGTAAACCTGCGTAAAACCGTTGTTAACTTCCCGGTTGAAGACGA 
AGCTCTGGGTGGCACCGCTTTCGTTCAG 

GGATGGAGGTACGAGCGTAGGAGGAGTAGTCGGACTGAACTTCCAGGGTACGGGTGTCCAG 

CAGCAGACCGCACCACGGGAACAGACCGT 
10 32T ATGCCGGCTCACGGTCTGTTCCCGTGGTGCGGTCTGCTGCTGGACACCCGTACCCTGGAAGT 

TCAGTCCGACTACTCCTCCTACGCTCGT 
33B GGGAGTGGCATTTCAGACGCAGAACACCGAACAGTTTACGACGCATGTTACGACCAGCTTT 

GAAACCACGGTTGAAGGTCAGGGAAGCAC 
33T ACCTCCATCCGTGCTTCCCTGACCTTCAACCGTGGTTTCAAAGCTGGTCGTAACATGCGTCGT 

15 AAACTGTTCGGTGTTCTGCGTCTGAAA 

34B ACGCGTGGAAACGGTAAGCCTGCAGCAGCAGGATTTTGTAGATGTTGGTGCAAACGGTCTG 

CAGGGAGTTTACCTGCAGGTCCAGGAACA 
34T TGCCACTCCCTGTTCCTGGACCTGCAGGTAAACTCCCTGCAGACCGTTTGCACCAACATCTA 

CAAAATCCTGCTGCTGCAGGCTTACCGT 
20 35B AGTAGCACAGGGAAGCGGTGTCGGAGATAACACGCAGGAAGAAGGTCGGGTTTTTCCAAA 

CCTGCTGGTGGAACGGCAGCTGCAGAACGC 
3 5T TTCCACGCGTGCGTTCTGCAGCTGCCGTTCCACCAGCAGGTTTGGAAAAACCCG ACCTTCTT 

CCTGCGTGTTATCTCCGACACCGCTTCC 
36B GGCACAGCCACTGAACAGCTTCGGACGGCAGCGGACCAGCAGCACCTTTAGCACCCAGGGA 

25 CATACCAGCGTTTTTAGCTTTCAGGATGG 

36T CTGTGCTACTCCATCCTGAAAGCTAAAAACGCTGGTATGTCCCTGGGTGCTAAAGGTGCTGC 

TGGTCCGCTGCCGTCCGAAGCTGTTCAG 
37B ACAGCTGGGTCTGAGCGGTACGCAGGGAACCCAGCAGCGGAACGTAGGTAACACGGTGAC 

GGGTCAGTTTCAGCAGGAAAGCCTGGT 
30 37T TGGCTGTGCCACCAGGCTTTCCTGCTGAAACTGACCCGTCACCGTGTTACCTACGTTCCGCT 

GCTGGGTTCCCTGCGTACCGCTCAG 

ACGGCAGAGCCGGGTTAGCAGCAGCTTCCAGAGCGGTCAGGGTGGTACCCGGCAGTTTACG 
GG 



38B 



VJVJ 

38T ACCCAGCTGTCCCGTAAACTGCCGGGTACCACCCTGACCGCTCTGGAAGCTGCTGCTAACCC 

39B 
39T 



35 GG 

39B GCGTGCCTCGAGGAATTCGGATCCATTAGTCCAGGATGGTTTTGAAGTCG 

CTCTGCCGTCCGACTTCAAAACCATCCTGGACTAATGGATCCGAATTCCTCGAGGCACGC 



40 

Table 10B 

GCACGCGGGAGCTCTAGAGTCGACCATATGCCGCGTGCTCCGCGTTGCCGTGCTGTTCGTT 
CCCTGCTGCGTTCCCACTATCGCGAAGTTCTGCCGCTGGCTACCTTCGTTCGTCGTCTGGG 
CCCGCAGGGTTGGCGTCTGGTTCAGCGTGGTGACCCGGCTGCTTTCCGTGCTCTGGTTGCT 

45 CAGTGCCTGGTTTGCGTTCCGTGGGACGCTCGTCCGCCGCCGGCTGCTCCGTCCTTCCGTC 
AGGTTTCCTGCCTGAAAGAACTGGTTGCTCGTGTTCTGCAGCGTCTGTGCGAACGTGGTGC 
TAAAAACGTTCTGGCTTTCGGTTTCGCTCTGCTGGACGGTGCTCGTGGTGGTCCGCCGGAA 
GCATTCACCACCTCCGTTCGTTCCTACCTGCCGAACACCGTTACCGACGCTCTGCGTGGTT 
CCGGTGCTTGGGGTCTGCTGCTGCGTCGTGTTGGTGACGA.CGTTCTGGTTCACCTGCTGGC 

50 TCGTTGCGCTCTGTTCGTTCTGGTTGCTCCGTCCTGCGCTTACCAGGTTTGTGGTCCGCCG 
CTGTACCAGCTGGGTGCTGCTACCCAGGCTCGTCCGCCGCCGCACGCTTCCGGTCCGCGTC 
GTCGTCTGGGTTGCGAACGTGCTTGGAACCACTCCGTTCGTGAAGCTGGTGTTCCGCTGGG 
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TCTGCCGGCTCCGGGTGCTCGTCGTCGTGGTGGTTCCGCTTCCCGTTCCCTGCCGCTGCCG 
AAACGTCCGCGTCGTGGTGCTGCTCCGGAACCGGAACGTACCCCGGTTGGTCAGGGTTCCT 
GGGCTCACCCGGGTCGTACCCGTGGTCCGTCCGACCGTGGTTTCTGCGTTGTTTCCCCGGC 
TCGTCCGGCTGAAGAAGCTACCTCCCTGGAAGGTGCTCTGTCCGGCACCCGTCACTCCCAC 
5 CCGTCCGTTGGTCGTCAGCACCACGCTGGTCCGCCGTCCACCTCCCGTCCGCCGCGTCCGT 
GGGACACCCCGTGCCCGCCGGTTTACGCTGAAACCAAACACTTCCTGTACTCCTCCGGTGA 
CAAAGAACAGCTGCGTCCGTCCTTCCTGCTGTCCTCCCTGCGTCCGTCCCTGACCGGTGCT 
CGTCGTCTGGTTGAAACTATCTTCCTGGGTTCCCGTCCGTGGATGCCGGGCACCCCGCGTC 
GTCTGCCGCGTCTGCCGCAGCGTTACTGGCAGATGCGTCCGCTGTTCCTGGAACTGCTGGG 

10 CAACCACGCTCAGTGCCCGTATGGTGTTCTGCTGAAAACCCACTGCCCGCTGCGTGCTGCT 
GTTACCCCGGCTGCTGGTGTTTGCGCTCGTGAAAAACCGCAGGGTTCCGTTGCTGCTCCGG 
AAGAAGAAGATACCGACCCGCGTCGTCTGGTTCAGCTGCTGCGTCAGCACTCCTCCCCGTG 
GCAGGTTTACGGTTTCGTTCGTGCATGCCTGCGTCGTCTGGxTTCCGCCGGGTCTGTGGGGT 
TCCCGTCACAACGAACGTCGTTTCCTGCGTAACACCAAAAJ^ATTCATCTCCCTGGGTAAAC 

15 ACGCTAAACTGTCCCTGCAGGAACTGACCTGGAAAATGTCCGTTCGTGACTGCGCTTGGCT 
GCGTCGTTCTCCGGGTGTTGGTTGCGTTCCGGCTGCTGAACACCGTCTGCGTGAAGAAATC 
CTGGCTAAATTCCTGCACTGGCTGATGTCCGTATACGTTGTTGAACTGCTGCGTTCCTTCT 
TCTACGTTACCGAAACCACCTTCCAGAAAAACCGTCTGTTCTTCTACCGTAAATCCGTTTG 
GTCCAAACTGCAGTCCATCGGTATCCGTCAGCACCTGAAACGTGTTCAGCTGCGTGAACTG 

20 TCCGAAGCTGAAGTTCGTCAGCACCGTGAAGCTCGTCCGGCTCTGCTGACCTCCCGTCTGC 
GTTTCATCCCGAAACCGGACGGTCTGCGTCCGATCGTAAACATGGACTACGTTGTTGGTGC 
TCGTACCTTCCGTCGTGAAAAACGTGCTGAGCGTCTGACCTCCCGTGTTAAAGCTCTGTTC 
TCCGTTCTGAACTACGAACGTGCTCGTCGTCCGGGTCTGCTGGGTGCTTCCGTTCTGGGTC 
TGGACGACATCCACCGTGCTTGGCGTACCTTCGTTCTGCGTGTTCGTGCTCAGGACCCGCC 

25 GCCGGAACTGTACTTCGTTAAAGTTGACGTTACCGGCGCGTACGACACCATCCCGCAGGAC 
CGTCTGACCGAAGTTATCGCTTCCATCATCAAACCGCAGA^CACCTACTGCGTTCGTCGTT 
ACGCTGTTGTTCAGAAAGCTGCTCACGGTCACGTTCGTAAA.GCATTCAAATCCCACGTTTC 
CACCCTGACCGACCTGCAGCCGTACATGCGTCAGTTCGTTGCTCACCTGCAGGAAACCTCC 
CCGCTGCGTGACGCTGTTGTTATCGAACAGTCCTCCTCCCTGAACGAAGCTTCCTCCGGTC 

30 TGTTCGACGTTTTCCTGCGTTTCATGTGCCACCACGCTGTTCGTATCCGTGGTAAATCCTA 
CGTTCAGTGCCAGGGTATCCCGCAGGGTTCCATCCTGTCCACCCTGCTGTGCTCCCTGTGC 
TACGGTGACATGGAAAACAAACTGTTCGCTGGTATCCGTCGTGACGGTCTGCTGCTGCGTC 
TGGTTGACGACTTCCTGCTGGTTACTCCGCACCTGACCCACGCTAAAACCTTCCTGCGTAC 
CCTGGTTCGTGGTGTTCCGGAATACGGTTGCGTTGTAAACCTGCGTAAAACCGTTGTTAAC 

35 TTCCCGGTTGAAGACGAAGCTCTGGGTGGCACCGCTTTCGTTCAGATGCCGGCTCACGGTC 
TGTTCCCGTGGTGCGGTCTGCTGCTGGACACCCGTACCCT'GGAAGTTCAGTCCGACTACTC 
CTCCTACGCTCGTACCTCCATCCGTGCTTCCCTGACCTTGAACCGTGGTTTCAAAGCTGGT 
CGTAACATGCGTCGTAAACTGTTCGGTGTTCTGCGTCTG^AATGCCACTCCCTGTTCCTGG 
ACCTGCAGGTAAACTCCCTGCAGACCGTTTGCACCAACATCTACAAAATCCTGCTGCTGCA 

40 GGCTTACCGTTTCCACGCGTGCGTTCTGCAGCTGCCGTTCCACCAGCAGGTTTGGAAAAAC 
CCGACCTTCTTCCTGCGTGTTATCTCCGACACCGCTTCCCTGTGCTACTCCATCCTGAAAG 
CTAAAAACGCTGGTATGTCCCTGGGTGCTAAAGGTGCTGC!TGGTCCGCTGCCGTCCGAAGC 

tgttcagtggctgtgccaccaggctttcctgctgaaactc;acccgtcaccgtgttacctac 
gttccgctgctgggttccctgcgtaccgctcagacccagc:tgtcccgtaaactgccgggta 
45 ccaccctgaccgctctggaagctgctgctaacccggctctgccgtccgacttcaaaaccat 
cctggactaatggatccgaattcctcgaggcacgc 
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The present invention also provides transgenic animals (i.e., mammals 
transgenic for a human or other TRT gene sequence) expressing an hTRT or other TRT 
polynucleotide or polypeptide. In one embodiment, hTRT is secreted into the milk of a 
5 transgenic mammal such as a transgenic bovine, goat, or rabbit. Methods for production of 
such animals are found, e.g., in Heyneker et al., PCT WO 91/08216. 

The hTRT proteins and complexes of the invention, including those made 
using the expression systems disclosed herein supra, may be purified using a variety of 
general methods known in the art in accordance with the specific methods provided by the 

10 present invention (e.g., infra). One of skill in the art will recognize that after chemical 
synthesis, biological expression, or purification, the hTRT protein may possess a 
conformation different than a native conformation of naturally occurring telomerase. In some 
instances, it may be helpful or even necessary to denature (e.g., including reduction of 
disulfide or other linkages) the polypeptide and then to cause the polypeptide to re-fold into 

15 the preferred conformation. Productive refolding may also require the presence of hTR (or 
hTR fragments). Methods of reducing and denaturing proteins and inducing re-folding are 
well known to those of skill in the art (see, e.g., Debinski et al., 1993, J. Biol Chem., 
268:14065; Kreitman and Pastan, 1993, Bioconjug. Chem., 4:581; and Buchner et al., 1992, 
Anal Biochem., 205:263; and McCaman et al., 1985, J. Biotech. 2:177). See also PCT 

20 Publication WO 96/40868, supra. 

D) COMPLEXES OF HUMAN TRT AND HUMAN TELOMERASE RNA, 
TELOMERASE-ASSOCIATED PROTEINS, AND OTH ER BIOMOLECULES 
PRODUCED BY COEXPRESSION AND OTHER MEANS 

25 hTRT polypeptides of the invention can associate in vivo and in vitro with 

other biomolecules, including RNAs (e.g., hTR), proteins (e,g,, telomerase-associated 
proteins), DNA (e.g., telomeric DNA, [T 2 AG 3 ] N ), and nucleotides, such as 
(deoxy)ribonucleotide triphosphates. These associations can be exploited to assay hTRT 
presence or function, to identify or purify hTRT or telomerase-associated molecules, and to 

30 analyze hTRT or telomerase structure or function in accordance with the methods of the 
present invention. 
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In one embodiment, the present invention provides hTRT complexed with 
(e.g., associated with or bound to) a nucleic acid, usually an RNA, for example to produce a 
telomerase holoenzyme. In one embodiment, the bound RNA is capable of acting as a 
template for telomerase-mediated DNA synthesis. Examples of RNAs that may be 
5 complexed with the hTRT polypeptide include a naturally occurring host cell telomerase 

RNA, a human telomerase RNA (e.g., hTR; U.S. Patent No. 5 ,583,016), an hTR subsequence 
or domain, a synthetic RNA, or other RNAs. The RNA-hTRT protein complex (an RNP) 
typically exhibits one or more telomerase activities, such as telomerase catalytic activities. 
These hTRT-hTR RNPs (or other hTRT-RNA complexes) can be produced by a variety of 

10 methods, as described infra for illustrative purposes, including in vitro reconstitution, by co- 
expression of hTRT and hTR (or other RNA) in vitro (i.e., in a cell free system), in vivo 
reconstitution, or ex vivo reconstitution. 

Thus, the present invention provides, in one embodiment, an hTRT-hTR 
complex (or other hTRT-RNA complex) formed in vitro by mixing separately purified 

15 components ("/« vitro reconstitution;" see, e.g., U.S. Patent No. 5,583,016 for a description of 
reconstitution; also see Autexier et al., EMBO J. 1 5:5928). In one embodiment the hTRT 
protein is produced by recombinant expression in human or non-human cells, e.g., as 
described supra, and subsequently purified using protein purification methods (e.g., 
chromatography, affinity purification). In a particular embodiment, the recombinant hTRT 

20 protein is purified to homogeneity. The purified hTRT proteiin is combined with separately 
purified hTR, which may be produced using an in vitro transcription system, by chemical 
synthesis, or by other methods and purified using standard RNA purification techniques (see 
Melton etal., 1984, Nucl Acids Res, 12:7035; Studier etal, 1986, J. Mol. Biol. 189:113). 

In an alternative embodiment, the invention provides telomerase RNPs 

25 produced by coexpression of the hTRT polypeptide and an RNA (e.g., hTR) in vitro in a cell- 
free transcription-translation system (e.g. wheat germ or rabbit reticulocyte lysate). As 
shown in Example 7, in vitro co-expression of a recombinant hTRT polypeptide and hTR 
results in production of telomerase catalytic activity (as measured by a TRAP assay). 

Further provided by the present invention are telomerase RNPs produced by 

30 expression of the hTRT polypeptide in a cell, e.g., a mammalian cell, in which hTR is 

naturally expressed or in which hTR (or another RNA capable of forming a complex with the 
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hTRT protein) is introduced or expressed by recombinant means. Thus, in one embodiment, 
hTRT is expressed in a telomerase negative human cell in which hTR is present (e.g., BJ or 
IMP90 cells), allowing the two molecules to assemble into an RNP. In another embodiment, 
hTRT is expressed in a human or non-human cell in which hTR is recombinantly expressed. 
5 Methods for expression of hTR in a cell are found in U.S. Patent 5,583,016. Further, a clone 
containing a cDNA encoding the RNA component of telomerase has been placed on deposit 
as pGRN33 (ATCC 75926). Genomic sequences encoding th e RNA component of human 
telomerase are also on deposit in the -15 kb SauIIIAl to Hindlll insert of lambda clone 28-1 
(ATCC 75925). For expression in eukaryotic cells the hTRT sequence will typically be 

1 0 operably linked to a transcription initiation sequence (RNA polymerase binding site) and 
transcription terminator sequences (see, e.g., PCT Publication WO 96/01835; Feng et al., 
1995, Science 269:1236). 

The present invention further provides recombinantly produced or 
substantially purified hTRT polypeptides coexpressed and/or associated with so-called 

1 5 "telomerase-associated proteins." Thus, the present invention provides hTRT coexpressed 
with, or complexed with, other proteins (e.g., telomerase-associated proteins). Telomerase- 
associated proteins are those proteins that copurify with human telomerase and/or that may 
play a role in modulating telomerase function or activity, for example by participating in the 
association of telomerase with telomeric DNA. Examples of telomerase-associated proteins 

20 include (but are not limited to) the following proteins and/or their human homologs: 

nucleolin {see, Srivastava et al., 1989, FEBS Letts. 250:99); EF2H (elongation factor 2 
homolog;«e Nomura etal. 1994, DNA Res. (Japan) 1:27, GENBANK accession #D2 1163); 
TP1/TLP1 (Harrington et al., 1997, Science 275:973; Nakayama, 1997, Cell 88:875); the 
human homologue of the Tetrahymena p95 or p95 itself (Collins et al., 1995, Cell 81:677); 

25 TPC2 (a telomere length regulatory protein; ATCC accession number 97708; TPC3 (also a 
telomere length regulatory protein; ATCC accession number 97707; DNA-binding protein B 
(dbpB; Horwitz et al., 1994, J. Biol Chem. 269:14130; and Telomere Repeat Binding Factors 
(TRF 1 & 2; Chang et aL, 1995, Science 270:1663; Chong et al., 1997, Hum Mol Genet 6:69); 
EST1, 3 and 4 (Lendvay et aL, 1996, Genetics 144:1399, Nugent et al., 1996, Science 

30 274:249, Lundblad et al., 1989, Cell 57:633); and End-capping factor (Cardenas et al., 1993, 
Genes Dev. 7:883). 
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Telomerase associated proteins can be identified on the basis of co-purification 
with, or binding to, hTRT protein or the hTRT-hTR RNP. Alternatively, they can be 
identified on the basis of binding to an hTRT fusion protein, e.g., a GST-hTRT fusion protein 
or the like, as determined by affinity purification (see, Ausubel et al. Ch 20). A particularly 
5 useful technique for assessing protein-protein interactions, which is applicable to identifying 
hTRT-associated proteins, is the two hybrid screen method of Chien et al. {Proc. Natl. Acad. 
Set. USA 88:9578 [1991]; see also Ausubel et al., supra, at Ch. 20). This screen identifies 
protein-protein interactions in vivo through reconstitution of a transcriptional activator, the 
yeast Gal4 transcription protein (see, Fields and Song, 1989, Nature 340:245. The method is 

10 based on the properties of the yeast Gal4 protein, which consists of separable domains 
responsible for DNA-binding and transcriptional activation. Polynucleotides, usually 
expression vectors, encoding two hybrid proteins are constructed. One polynucleotide 
comprises the yeast Gal4 DNA-binding domain fused to a polypeptide sequence of a protein 
to be tested for an hTRT interaction (e.g., nucleolin or EF2H). Alternatively the yeast Gal4 

15 DNA-binding domain is fused to cDNAs from a human cell, thus creating a library of human 
proteins fused to the Gal4 DNA binding domain for screening for telomerase associated 
proteins. The other polynucleotide comprises the Gal4 activation domain fused to an hTRT 
polypeptide sequence. The constructs are introduced into a yeast host cell. Upon expression, 
intermolecular binding between hTRT and the test protein c;an reconstitute the Gal4 DNA- 

20 binding domain with the Gal4 activation domain. This leads to the transcriptional activation 
of a reporter gene (e.g., lacZ, HIS3) operably linked to a Gal4 binding site. By selecting for, 
or by assaying the reporter, gene colonies of cells that contain an hTRT interacting protein or 
telomerase associated protein can be identified. Those of skill will appreciate that there are 
numerous variations of the 2-hybrid screen, e.g., the LexA system (Bartel et al, 1993, in 

25 Cellular Interactions in Development: A Practical Approach Ed. Hartley, D.A. (Oxford Univ. 
Press) pp. 153-79). 

Another useful method for identifying telomerase-associated proteins is a 
three-hybrid system (see, e.g., Zhang et al., 1996, Anal. Biochem. 242:68; Licitra et al., 1996, 
Proc. Natl. Acad. Sci. USA 93:12817). The telomerase RNA component can be utilized in 

30 this system with the TRT or hTRT protein and a test protein.. Another useful method for 

identifying interacting proteins, particularly (i.e., proteins that heterodimerize or form higher 
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order heteromultimers), is the E. coli/BCCP interactive screening system (see, Germino et al. 
(1993) Proc. Natl Acad. Set U.S.A. 90:933; Guarente (1993) Proc. Natl Acad. Set (U.S.A.) 
90:1639). 

The present invention also provides complexes of telomere binding proteins 
5 (which may or may not be telomerase associated proteins) and hTRT (which may or may not 
be complexed with hTR, other RNAs, or one or more telomerase associated proteins). 
Examples of telomere binding proteins include TRF1 and TRF2 (supra); rnpAl, rnpA2, 
RAP1 (Buchman et al., 1988, Mol Cell Biol 8:210, Buchman et al., 1988, Mol Cell Biol 
8:5086), SIR3 and SIR4 (Aparicio et al, 1991, Cell 66:1279), TEL1 (Greenwell et al., 1995, 
10 Cell 82:823; Morrow et al., 1995, Cell 82:831); ATM (Savitsky et al, 1995, Science 

268:1749), end-capping factor (Cardenas et al., 1993, Genes Dev. 7:883), and corresponding 
human homologs. The aforementioned complexes may be produced generally as described 
supra for complexes of hTRT and hTR or telomerase associated proteins, e.g., by mixing or 
co-expression in vitro or in vivo. 

15 

V. ANTIBODIES AND OTHER BINDING AGENTS 

In a related aspect, the present invention provides antibodies that are 
specifically immunoreactive with hTRT, including polyclonal and monoclonal antibodies, 
antibody fragments, single chain antibodies, human and chimeric antibodies, including 

20 antibodies or antibody fragments fused to phage coat or cell surface proteins, and others 
known in the art and described herein. The antibodies of the invention can specifically 
recognize and bind polypeptides that have an amino acid sequence that is substantially 
identical to the amino acid sequence set forth in Figure 1 7 (SEQUENCE ID NO: 2), or an 
immunogenic fragment thereof or epitope on the protein defined thereby. The antibodies of 

25 the invention can exhibit a specific binding affinity for hTRT of at least about 10 7 , 10 8 , 10 9 , or 
10 10 M" 1 , and may be polyclonal, monoclonal, recombinant or otherwise produced. The 
invention also provides anti-hTRT antibodies that recognize an hTRT conformational epitope 
(e.g., an epitope on the surface of the hTRT protein or a telomerase RNP). Likely 
conformational epitopes can be identified, if desired, by computer-assisted analysis of the 

30 hTRT protein sequence, comparison to the conformation of related reverse transcriptases, 

such as the p66 subunit of HIV- 1 (see, e.g., Figure 3), or empirically. Anti-hTRT antibodies 
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that recognize conformational epitopes have utility, inter alia, in detection and purification of 
human telomerase and in the diagnosis and treatment of human disease. 

For the production of anti-hTRT antibodies, hosts such as goats, sheep, cows, 
guinea pigs, rabbits, rats, or mice, may be immunized by injection with hTRT protein or any 
5 portion, fragment or oligopeptide thereof which retains immunogenic properties. In selecting 
hTRT polypeptides for antibody induction, one need not retain biological activity; however, 
the protein fragment, or oligopeptide must be immunogenic, and preferably antigenic. 
Immunogenicity can be determined by injecting a polypeptide and adjuvant into an animal 
(e.g., a rabbit) and assaying for the appearance of antibodies directed against the injected 

10 polypeptide (see, e.g., Harlow and Lane, Antibodies: A Laboratory Manual, Cold 

Spring Harbor Laboratory, New York (1988), which is incorporated in its entirety and 
for all purposes, e.g., at Chapter 5). Peptides used to induce specific antibodies typically have 
an amino acid sequence consisting of at least five amino acids, preferably at least 8 amino 
acids, more preferably at least 10 amino acids. Usually they will mimic or have substantial 

15 sequence identity to all or a contiguous portion of the amino acid sequence of the protein of 
SEQUENCE ID NO: 2. Short stretches of hTRT protein amino acids may be fused with 
those of another protein, such as keyhole limpet hemocyanin, and an anti-hTRT antibody 
produced against the chimeric molecule. Depending on the host species, various adjuvants 
may be used to increase immunological response. 

20 The antigen is presented to the immune system in a fashion determined by 

methods appropriate for the animal. These and other parameters are generally well known to 
immunologists. Typically, injections are given in the footpads, intramuscularly, 
intradermally, perilymph nodally or intraperitoneally. The immunoglobulins produced by the 
host can be precipitated, isolated and purified by routine methods, including affinity 

25 purification. 

Illustrative examples of immunogenic hTRT peptides include are provided in 
Example 8. In addition, Example 8 describes the production, and use of anti-hTRT polyclonal 
antibodies. 

30 A) MONOCLONAL ANTIBODIES 

Monoclonal antibodies to hTRT proteins and peptides may be prepared in 
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accordance with the methods of the invention using any technique which provides for the 
production of antibody molecules by continuous cell lines in culture. These include, but are 
not limited to, the hybridoma technique originally described by Koehler and Milstein (Nature 
256:495 [1975]), the human B-cell hybridoma technique (Kosbor et al., 1983, Immunol. 
5 Today 4:72; Cote et al., 1 983, Proc. Natl. Acad. Sci. USA, 80:2026), and the EBV-hybridoma 
technique (Cole et al., Monoclonal Antibodies and Cancer Therapy, Alan R Liss Inc, 
New York NY, pp 77-96 [1985]). 

In one embodiment, appropriate animals are selected and the appropriate 
immunization protocol followed. The production of non-human monoclonal antibodies, e.g., 
1 0 murine, lagomorpha, equine, is well known and can be accomplished by, for example, 
immunizing an animal with a preparation containing hTRT or fragments thereof. In one 
method, after the appropriate period of time, the spleens of the animals are excised and 
individual spleen cells are fused, typically, to irnmortalized myeloma cells under appropriate 
selection conditions. Thereafter, the cells are clonally separated and the supernatants of each 
1 5 clone (e.g., hybridoma) are tested for the production of an appropriate antibody specific for 
the desired region of the antigen. Techniques for producing antibodies are well known in the 
art. See, e.g., Goding et al., Monoclonal Antibodies: Principles and Practice (2d ed.) 
Acad. Press, N.Y., and Harlow and Lane, supra, each of which is incorporated in its entirety 
and for all purposes. Other suitable techniques involve the in vitro exposure of lymphocytes 
20 to the antigenic polypeptides or alternatively, to selection of libraries of antibodies in phage 
or similar vectors (see, infra). 

B) HUMAN ANTIBODIES 

In another aspect of the invention, human antibodies against an hTRT 
25 polypeptide are provided. Human monoclonal antibodies against a known antigen can also be 
made using transgenic animals having elements of a human immune system (see, e.g., U.S. 
Patent Nos. 5,569,825 and 5,545,806, both of which are incorporated by reference in their 
entirety for all purposes) or using human peripheral blood cells (Casali et al., 1986, Science 
234:476). Some human antibodies are selected by competitive binding experiments, or 
30 otherwise, to have the same epitope specificity as a particular mouse antibody. 

In an alternative embodiment, human antibodies to an hTRT polypeptide can 
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be produced by screening a DNA library from human B cells according to the general 
protocol outlined by Huse et al., 1989, Science 246:1275, which is incorporated by reference. 
Antibodies binding to the hTRT polypeptide are selected. Sequences encoding such 
antibodies (or binding fragments) are then cloned and amplified. The protocol described by 
5 Huse is often used with phage-display technology. 

C) HUMANIZED OR CHIMERIC ANTIBODIES 

The invention also provides anti-hTRT antibodies that are made chimeric, 
human-like or humanized, to reduce their potential antigenicity, without reducing their 

1 0 affinity for their target. Preparation of chimeric, human-like and humanized antibodies have 
been described in the art (see, e.g., U.S. Patent Nos. 5,585,089 and 5,530,101; Queen, et al., 
1989, Proc. Nat'lAcad. Set USA 86:10029; and Verhoeyan et al, 1988, Science 239:1534; 
each of which is incorporated by reference in their entirety and for all purposes). Humanized 
immunoglobulins have variable framework regions substantially from a human 

1 5 immunoglobulin (termed an acceptor immunoglobulin) and complementarity determining 
regions substantially from a non-human (e.g., mouse) immunoglobulin (referred to as the 
donor immunoglobulin). The constant region(s), if present, are also substantially from a 
human immunoglobulin. 

In some applications, such as administration to human patients, the humanized 

20 (as well as human) anti-hTRT antibodies of the present invention offer several advantages 
over antibodies from murine or other species: (1) the human immune system should not 
recognize the framework or constant region of the humanized antibody as foreign, and 
therefore the antibody response against such an injected antibody should be less than against 
a totally foreign mouse antibody or a partially foreign chimeric antibody; (2) because the 

25 effector portion of the humanized antibody is human, it may interact better with other parts of 
the human immune system; and (3) injected humanized antibodies have a half-life essentially 
equivalent to naturally occurring human antibodies, allowing smaller and less frequent doses 
than antibodies of other species. As implicit from the foregoing, anti hTRT antibodies have 
application in the treatment of disease, i.e., to target telomerase-positive cells. 

30 
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D) PHAGE DISPLAY 

The present invention also provides anti-hTRT antibodies (or binding 
compositions) produced by phage display methods (see, e.g., Dower et al., WO 91/17271 and 
McCafferty et aL, WO 92/01047; and Vaughan et al, 1996, Nature Biotechnology, 14: 309; 
5 each of which is incorporated by reference in its entirety for all purposes). In these methods, 
libraries of phage are produced in which members display different antibodies on their outer 
surfaces. Antibodies are usually displayed as Fv or Fab fragments. Phage displaying 
antibodies with a desired specificity are selected by affinity enrichment to an hTRT 
polypeptide. 

10 In a variation of the phage-display method, humanized antibodies having the 

binding specificity of a selected murine antibody can be produced. In this method, either the 
heavy or light chain variable region of the selected murine antibody is used as a starting 
material. If, for example, a light chain variable region is selected as the starting material, a 
phage library is constructed in which members display the same light chain variable region 

15 (i.e., the murine starting material) and a different heavy chain variable region. The heavy 
chain variable regions are obtained from a library of rearranged human heavy chain variable 
regions. A phage showing strong specific binding for the hTRT polypeptide (e.g., at least 10 8 
and preferably at least 10 9 M" 1 ) is selected. The human heavy chain variable region from this 
phage then serves as a starting material for constructing a further phage library. In this 

20 library, each phage displays the same heavy chain variable region (i.e., the region identified 
from the first display library) and a different light chain variable region. The light chain 
variable regions are obtained from a library of rearranged human variable light chain regions. 
Again, phage showing strong specific binding are selected. These phage display the variable 
regions of completely human anti-hTRT antibodies. These antibodies usually have the same 

25 or similar epitope specificity as the murine starting material. 

E) HYBRID ANTIBODIES 

The invention also provides hybrid antibodies that share the specificity of 
antibodies against an hTRT polypeptide but are also capable of specific binding to a second 
30 moiety. In such hybrid antibodies, one heavy and light chain pair is usually from an anti- 
hTRT antibody and the other pair from an antibody raised against another epitope or protein. 
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This results in the property of multi-functional valency, i.e., ability to bind at least two 
different epitopes simultaneously, where at least one epitope is the epitope to which the anti- 
complex antibody binds. Such hybrids can be formed by fusion of hybridomas producing the 
respective component antibodies, or by recombinant techniques. Such hybrids can be used to 
5 carry a compound (i.e., drug) to a telomerase-positive cell (i.e., a cytotoxic agent is delivered 
to a cancer cell). 

Immunoglobulins of the present invention can also be fused to functional 
regions from other genes (e.g., enzymes) to produce fusion proteins (e.g., immunotoxins) 
having useful properties. 

10 

F) ANTI-IDIOTYPIC ANTIBODIES 

Also useful are anti-idiotype antibodies which can be isolated by the above 
procedures. Anti-idiotypic antibodies may be prepared by, for example, immunization of an 
animal with the primary antibody (i.e., anti-hTRT antibodies or hTRT-binding fragments 

1 5 thereof). For anti-hTRT antibodies, anti-idiotype antibodies whose binding to the primary 
antibody is inhibited by an hTRT polypeptide or fragments thereof are selected. Because 
both the anti-idiotypic antibody and the hTRT polypeptide or fragments thereof bind the 
primary immunoglobulin, the anti-idiotypic immunoglobulin can represent the "internal 
image" of an epitope and thus can substitute for the hTRT polypeptide in assays or can be 

20 used to bind (i.e., inactivate) anti-hTRT antibodies, e.g., in a patient. Anti-idiotype antibodies 
can also interact with telomerase associated proteins. Administration of such antibodies can 
affect telomerase function by titrating out or competing with hTRT in binding to hTRT- 
associated proteins. 

25 G) GENERAL 

The antibodies of the invention may be of any isotype, e.g., IgM, IgD, IgG, 
IgA, and IgE, with IgG, IgA and IgM often preferred. Humanized antibodies may comprise 
sequences from more than one class or isotype. 

In another embodiment of the invention, fragments of the intact antibodies 
30 described above are provided. Typically, these fragments can compete with the intact 

antibody from which they were derived for specific binding to the hTRT polypeptide, and 
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bind with an affinity of at least 10 7 , 10 8 * 10 9 M 1 , or 10 10 M' 1 . Antibody fragments include 
separate heavy chains, light chains, Fab, Fab 1 F(ab ! ) 2 , Fabc, and Fv. Fragments can be 
produced by enzymatic or chemical separation of intact immunoglobulins. For example, a 
F(ab') 2 fragment can be obtained from an IgG molecule by proteolytic digestion with pepsin 
5 at pH 3 .0-3.5 using standard methods such as those described in Harlow and Lane, supra. 
Fab fragments may be obtained from F(ab ! ) 2 fragments by limited reduction, or from whole 
antibody by digestion with papain in the presence of reducing agents (see generally, Paul, W., 
ed Fundamental Immunology 2nd Raven Press, N.Y., 1989, Ch. 7, incorporated by 
reference in its entirety for all purposes). Fragments can also be produced by recombinant 

1 0 DNA techniques. Segments of nucleic acids encoding selected fragments are produced by 
digestion of full-length coding sequences with restriction enzymes, or by de novo synthesis. 
Often fragments are expressed in the form of phage-coat fusion proteins. 

Many of the immunoglobulins described above can undergo non-critical 
amino-acid substitutions, additions or deletions in both the variable and constant regions 

1 5 without loss of binding specificity or effector functions, or intolerable reduction of binding 
affinity (i.e., below about 10 7 M' 1 ). Usually, immunoglobulins incorporating such alterations 
exhibit substantial sequence identity to a reference immunoglobulin from which they were 
derived. A mutated immunoglobulin can be selected having the same specificity and 
increased affinity compared with a reference immunoglobulin from which it was derived. 

20 Phage-display technology offers useful techniques for selecting such immunoglobulins. See, 
e.g., Dower et al., WO 91/17271 McCafferty et al., WO 92/01047; and Huse, WO 92/06204. 

The antibodies of the present invention can be used with or without 
modification. Frequently, the antibodies will be labeled by joining, either covalently or non- 
covalently, a detectable label. As labeled binding entities, the antibodies of the invention are 

25 particularly useful in diagnostic applications. 

The anti-hTRT antibodies of the invention can be purified using well known 
methods. The whole antibodies, their dimers, individual light and heavy chains, or other 
immunoglobulin forms of the present invention can be purified using the methods and 
reagents of the present invention in accordance with standard procedures of the art, including 

30 ammonium sulfate precipitation, affinity columns, column chromatography, gel 

electrophoresis and the like (see generally Scopes, PROTEIN PURIFICATION: PRINCIPLES AND 
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Practice 3rd Edition (Springer-Verlag, N.Y., 1994)). Substantially pure immunoglobulins 
of at least about 90 to 95%, or even 98 to 99% or more homogeneity are preferred. 

VI. PURIFICATION OF HUMAN TELOMERASE 

5 The present invention provides isolated human telomerase of unprecedented 

purity. In particular, the present invention provides: purified hTRT of recombinant or 
nonrecombinant origin; purified hTRT-hTR complexes (i.e., RNPs) of recombinant, 
nonrecombinant, or mixed origin, optionally comprising one or more telomerase-associated 
proteins; purified naturally occurring human telomerase; and the like. Moreover, the 
1 0 invention provides methods and reagents for partially, substantially or highly purifying the 
above-molecules and complexes, including variants, fusion proteins, naturally occurring 
proteins, and the like (collectively referred to as "hTRT and/or hTRT complexes"). 

Prior to the present disclosure, attempts had been made to purify the 
telomerase enzyme complex to homogeneity had met with limited success. The methods 
1 5 provided in the aforelisted applications provide purification of telomerase by approximately 
up to 60,000-fold or more compared to crude cell extracts. The present invention provides 
hTRT and hTRT complexes of even greater purity, in part by virtue of the novel 
immunoaffinity reagents (e.g., anti-hTRT antibodies) of the present invention, and/or the 
reagents, cells, and methods provided herein for recombinant expression of hTRT. 
20 Recombinant expression of hTRT and hTRT complexes facilitates purification because the 
desired molecules can be produced at much higher levels than found in most expressing cells 
occurring in nature, and/or because the recombinant hTRT molecule can be modified (e.g., by 
fusion with an epitope tag) such that it may be easily purified. 

It will be recognized that naturally occurring telomerase can be purified from 
25 any telomerase-positive cell, and recombinant hTRT and hTRT complexes can be expressed 
and purified, inter alia, using any of the in vitro, in vivo, ex vivo, or plant or animal 
expression systems disclosed supra, or others/systems known in the art. 

In one embodiment, the hTRT, telomerase and other compositions of the 
invention are purified using an immunoaffinity step, alone or in combination with other 
30 purification steps. Typically, an immobilized or immobilizable anti-hTRT antibody, as 
provided by the present invention, is contacted with a sample, such as a cell lysate, that 
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contains the desired hTRT or hTRT-containing complex under conditions in which anti- 
hTRT antibody binds the hTRT antigen. After removal of the unbound components of the 
sample by methods well known in the art, the hTRT composition may be eluted, if desired, 
from the antibody, in substantially pure form. In one embodiment, immunoaffinity 
5 chromatography methods well known in the art are used (see, e.g., Harlow and Lane, supra; 
and Ausubel, supra; Hermansan et ah, 1992, Immobilized Affinity Ligand Techniques 
(Academic Press, San Diego)) in accordance with the methods of the invention. In another 
illustrative embodiment, immunoprecipitation of anti-hTRT-immunoglobulin-hTRT 
complexes is carried out using immobilized Protein A. Numerous variations and alternative 
10 immunoaffinity purification protocols suitable for use in accordance with the methods and 
reagents of the invention are well-known to those of skill. 

In another embodiment, recombinant hTRT proteins can, as a consequence of 
their high level of expression, be purified using routine protein purification methods, such as 
ammonium sulfate precipitation, affinity columns (e.g., immunoaffinity), size-exclusion, 
15 anion and cation exchange chromatography, gel electrophoresis and the like (see, generally, 
R. Scopes, Protein Purification, Springer- Verlag, N.Y. (1982) and Deutscher, Methods 
in Enzymology Vol. 182: Guide to Protein Purification, Academic Press, Inc. N.Y. 
(1990)) instead of, or in addition to, immunoaffinity methods. Cation exchange methods can 
be particularly useful due to the basic pi of the hTRT protein. For example, immobilized 
20 phosphate may be used as a cation exchange functional group (e.g., P-l 1 Phosphocellulose, 
Whatman catalog #4071 or Cellulose Phosphate, Sigma catalog #C 3 145). Immobilized 
phosphate has two advantageous features for hTRT purification - it is a cation exchange resin, 
and it shows physical resemblance to the phosphate backbone of nucleic acid. This can allow 
for affinity chromatography because hTRT binds hTR and telomeric DNA. Other non- 
25 specific and specific nucleic acid affinity chromatography methods are also useful for 

purification (e.g., Alberts et aL, 1971, Methods Enzymol. 21:198; Arnt-Jovin et al., 1975, 
Eur. J, Biochem. 54:411; Pharmacia catalog #27-5575-02). Further exploitation of this 
binding function of hTRT could include the use of specific nucleic acid (e.g., telomerase 
primer or hTR) affinity chromatography for purification (Chodosh et al., 1986, Mol Cell, 
30 Biol 6:4723; Wu et al., 1987, Science 238:1247; Kadonaga, 1991, Methods Enzymol 

208:10); immobilized Cibricon Blue Dye, which shows physical resemblance to nucleotides, 
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is another useful resin for hTRT purification (Pharmacia catalog #17-0948-01 or Sigma 
catalog #C 1285), due to hTRT binding of nucleotides (e.g., as substrates for DNA synthesis). 

In one embodiment, hTRT proteins are isolated directly from an in vitro or in 
vivo expression system in which other telomerase components are not coexpressed. It will be 
5 recognized that isolated hTRT protein may also be readily obtained from purified human 
telomerase or hTRT complexes, for example, by disrupting the telomerase RNP (e.g., by 
exposure to a mild or other denaturant) and separating the RNP components (e.g., by routine 
means such as chromatography or immunoaffinity chromatography). 

Telomerase purification may be monitored using a telomerase activity assay 

10 (e.g., the TRAP assay, conventional assay, or primer-binding assay), by measuring the 
enrichment of hTRT (e.g., by ELISA), by measuring the enrichment of hTR, or other 
methods known in the art. 

The purified human telomerase, hTRT proteins, and hTRT complexes 
provided by the present invention are, in one embodiment, highly purified (i.e., at least about 

15 90% homogeneous, more often at least about 95% homogeneous). Homogeneity can be 
determined by standard means such as SDS-polyacrylamide gel electrophoresis and other 
means known in the art (see, e.g., Ausubel et al, supra). It will be understood that, although 
highly purified human telomerase, hTRT protein, or hTRT complexes are sometimes desired, 
substantially purified (e.g., at least about 75% homogeneous) or partially purified (e.g., at 

20 least about 20% homogeneous) human telomerase, hTRT protein, or hTRT complexes are 
useful in many applications, and are also provided by the present invention. For example, 
partially purified telomerase is useful for screening test compounds for telomerase 
modulatory activity, and other uses (see, infra and supra; see U.S. Patent No. 5,645,986). 

25 VIL TREATMENT OF TELOMERASE-RELATED DISEASE 
A) INTRODUCTION 

The present invention provides hTRT polynucleotides, polypeptides, and 
antibodies useful for the treatment of human diseases and disease conditions. The 
recombinant and synthetic hTRT gene products (protein and mRNA) of the invention can be 
30 used to create or elevate telomerase activity in a cell, as well as to inhibit telomerase activity 
in cells in which it is not desired. Thus, inhibiting, activating or otherwise altering a 
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telomerase activity (e,g., telomerase catalytic activity, fidelity, processivity, telomere binding, 
etc.) in a cell can be used to change the proliferative capacity of the cell. For example, 
reduction of telomerase activity in an immortal cell, such as a malignant tumor cell, can 
render the cell mortal Conversely, increasing the telomerase activity in a mortal cell (e.g., 
5 most human somatic cells) can increase the proliferative capacity of the cell. For example, 
expression of hTRT protein in dermal fibroblasts, thereby increasing telomere length, will 
result in increased fibroblast proliferative capacity; such expression can slow or reverse the 
age-dependent slowing of wound closure (see, e.g., West, 1994, Arch. Derm. 130:87). 

Thus, in one aspect, the present invention provides reagents and methods 
1 0 useful for treating diseases and conditions characterized by the presence, absence, or amount 
of human telomerase activity in a cell and that are susceptible to treatment using the 
compositions and methods disclosed herein. These diseases include, as described more fully 
below, cancers, other diseases of cell proliferation (particularly diseases of aging), 
immunological disorders, infertility (or fertility), and others. 

15 

B) TREATMENT OF CANCER 

The present invention provides methods and compositions for reducing 
telomerase activity in tumor cells and for treating cancer. Compositions include antisense 
oligonucleotides, peptides, gene therapy vectors encoding antisense oligonucleotides or 

20 activity altering proteins, and anti-hTRT antibodies. Cancer cells (e.g., malignant tumor 
cells) that express telomerase activity (telomerase-positive cells) can be mortalized by 
decreasing or inhibiting the endogenous telomerase activity. Moreover, because telomerase 
levels correlate with disease characteristics such as metastatic potential (e.g., U.S. Patent No. 
5,639,613; 5,648,215; 5,489,508; Pandita et al., 1996, Proa Am. Ass. Cancer Res. 37:559), 

25 any reduction in telomerase activity could reduce the aggressive nature of a cancer to a more 
manageable disease state (increasing the efficacy of traditional interventions). 

The invention provides compositions and methods useful for treatment of 
cancers of any of a wide variety of types, including solid tumors and leukemias. Types of 
cancer that may be treated include (but are not limited to): adenocarcinoma of the breast, 

30 prostate, and colon; all forms of bronchogenic carcinoma of the lung; myeloid; melanoma; 
hepatoma; neuroblastoma; papilloma; apudoma; choristoma; branchioma; malignant 
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carcinoid syndrome; carcinoid heart disease; carcinoma (e.g., Walker, basal cell, 
basosquamous, Brown-Pearce, ductal, Ehrlich tumor, in situ, Krebs 2, merkel cell, mucinous, 
non-small cell lung, oat cell, papillary, scirrhous, bronchiolar, bronchogenic, squamous cell, 
and transitional cell), histiocytic disorders; leukemia (e.g., B-cell, mixed-cell, null-cell, 
T-cell, T-cell chronic, HTLV-II-associated, lyphocytic acute, lymphocytic chronic, mast-cell, 
and myeloid); histiocytosis malignant; Hodgkin's disease; immunoproliferative small; 
non-Hodgkin's lymphoma; plasmacytoma; reticuloendotheliosis; melanoma; 
chondroblastoma; chondroma; chondrosarcoma; fibroma; fibrosarcoma; giant cell tumors; 
histiocytoma; lipoma; liposarcoma; mesothelioma; myxoma; myxosarcoma; osteoma; 
osteosarcoma; Ewing r s sarcoma; synovioma; adenofibroma; adenolymphoma; 
carcinosarcoma; chordoma; craniopharyngioma; dysgerminoma; hamartoma; 
mesenchymoma; mesonephroma; myosarcoma; ameloblastoma; cementoma; odontoma; 
teratoma; thymoma; trophoblastic tumor; adenocarcinoma; adenoma; cholangioma; 
cholesteatoma; cylindroma; cystadenocarcinoma; cystadenoma; granulosa cell tumor; 
gynandroblastoma; hepatoma; hidradenoma; islet cell tumor; leydig cell tumor; papilloma; 
Sertoli cell tumor; theca cell tumor; leiomyoma; leiomyosarcoma; myoblastoma; myoma; 
myosarcoma; rhabdomyoma; rhabdomyosarcoma; ependymoma; ganglioneuroma; glioma; 
medulloblastoma; meningioma; neurilemmoma; neuroblastoma; neuroepithelioma; 
neurofibroma; neuroma; paraganglioma; paraganglioma nonchromaffin; angiokeratoma; 
angiolymphoid hyperplasia with eosinophilia; angioma sclerosing; angiomatosis; 
glomangioma; hemangioendothelioma; hemangioma; hemangiopericytoma; • 
hemangiosarcoma; lymphangioma; lymphangiomyoma; lymphangiosarcoma; pinealoma; 
carcinosarcoma; chondrosarcoma; cystosarcoma phyllodes; fibrosarcoma; hemangiosarcoma; 
leiomyosarcoma; leukosarcoma; liposarcoma; lymphangiosarcoma; myosarcoma; 
myxosarcoma; ovarian carcinoma; rhabdomyosarcoma; sarcoma (e.g., Ewing ! s, experimental, 
Kaposi's, and mast-cell); neoplasms (e.g., bone, breast, digestive system, colorectal, liver, 
pancreatic, pituitary, testicular, orbital, head and neck, central nervous system, acoustic, 
pelvic, respiratory tract, and urogenital); neurofibromatosis, and cervical dysplasia). The 
invention provides compositions and methods useful for treatment of other conditions in 
which cells have become immortalized or hyperproliferative, e.g., by disregulation (e.g., 
abnormally high expression) of hTRT, telomerase enzyme, or telomerase activity. 
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The present invention further provides compositions and methods for 



prevention of cancers, including anti-hTRT vaccines, gene therapy vectors that prevent 
telomerase activation, and gene therapy vectors that result in specific death of telomerase- 
positive cells. In a related aspect, the gene replacement therapy methods described below 
5 may be used for "treating" a genetic predilection for cancers. 

C) TREATMENT OF OTHER CONDITIONS 

The present invention also provides compositions and methods useful for 
treatment of diseases and disease conditions (in addition to cancers) characterized by under- 
10 or over-expression of telomerase or hTRT gene products. Examples include: diseases of cell 
proliferation, diseases resulting from cell senescence (particularly diseases of aging), 
immunological disorders, infertility, diseases of immune dysfunction, and others. 



changes due to reduced telomere length (compared to younger cells), resulting from the 

1 5 absence (or much lower levels) of telomerase activity in the cell. Decreased telomere length 
and decreased replicative capacity contribute to diseases such as those described below. 
Telomerase activity and telomere length can be increased by, for example, increasing levels 
of hTRT gene products (protein and mRNA) in the cell. A partial listing of conditions 
associated with cellular senescence in which hTRT expression can be therapeutic includes 

20 Alzheimer's disease, Parkinson's disease, Huntington's disease, and stroke; age-related 

diseases of the integument such as dermal atrophy, elastolysis and skin wrinkling, sebaceous 
gland hyperplasia, senile lentigo, graying of hair and hair loss, chronic skin ulcers, and 
age-related impairment of wound healing; degenerative joint disease; osteoporosis; 
age-related immune system impairment (e.g., involving cells such as B and T lymphocytes, 

25 monocytes, neutrophils, eosinophils, basophils, NK cells and their respective progenitors); 
age-related diseases of the vascular system including atherosclerosis, calcification, 
thrombosis, and aneurysms; diabetes, muscle atrophy, respiratory diseases, diseases of the 
liver and GI tract, metabolic diseases, endocrine diseases (e.g., disorders of the pituitary and 
adrenal gland), reproductive diseases, and age-related macular degeneration. These diseases 

30 and conditions can be treated by increasing the levels of hTRT gene products in the cell to 
increase telomere length, thereby restoring or imparting greater replicative capacity to the 



Certain diseases of aging are characterized by cell senescence-associated 
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cell. Such methods can be carried out on cells cultured ex vivo or cells in vivo. In one 
embodiment, the cells are first treated to activate telomerase and lengthen telomeres, and then 
treated to inactivate the hTRT gene and telomerase activity. In a preferred embodiment, 
telomerase activity is generated by a vector of the invention in an embryonic germ or stem 
5 cell prior to or during differentiation. 

The present invention also provides methods and composition useful for 
treating infertility. Human germline cells (e.g., spermatogonia cells, their progenitors or 
descendants) are capable of indefinite proliferation and characterized by high telomerase 
activity. Abnormal or diminished levels of hTRT gene products can result, for example, in 

10 inadequate or abnormal production of spermatozoa, leading to infertility or disorders of 

reproduction. Accordingly, "telomerase-based" infertility can be treated using the methods 
and compositions described herein to increase telomerase levels. Similarly, because 
inhibition of telomerase may negatively impact spermatogenesis, oogenesis, and sperm and 
egg viability, the telomerase inhibitory compositions of the invention can have contraceptive 

15 effects when used to reduce hTRT gene product levels in germline cells. 

Further, the invention provides methods and composition useful for decreasing 
the proliferative potential of telomerase-positive cells such as activated lymphocytes and 
hematopoietic stem cells by reducing telomerase activity. Thus, the invention provide means 
for effecting immunosuppression. Conversely, the methods and reagents of the invention are 

20 useful for increasing telomerase activity and proliferative potential in cells, such as stem 

cells, that express a low level of telomerase or no telomerase prior to therapeutic intervention. 

D) MODES OF INTERVENTION 

As is clear from the foregoing discussion, modulation of the level of 
25 telomerase or telomerase activity of a cell can have a profound effect on the proliferative 
potential of the cell, and so has great utility in treatment of disease. As is also clear, this 
modulation may be either a decrease in telomerase activity or an increase in activity. The 
telomerase modulatory molecules of the invention can act through a number of mechanisms; 
some of these are described in this and the following subsections to aid the practitioner in 
30 selecting therapeutic agents. However, applicants do not intend to be limited to any particular 
mechanism of action for the novel therapeutic compounds, compositions and methods 
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described herein. 

Telomerase activity may be decreased through any of several mechanisms or 
combinations of mechanisms. One mechanism is the reduction of hTRT gene expression to 
reduce telomerase activity. This reduction can be at the level of transcription of the hTRT 
5 gene into mRNA, processing (e.g., splicing), nuclear transport or stability of mRNA, 

translation of mRNA to produce hTRT protein, or stability and function of hTRT protein. 
Another mechanism is interference with one or more activities of telomerase (e.g., the reverse 
transcriptase catalytic activity, or the hTR-binding activity) using inhibitory nucleic acids, 
polypeptides, or other agents (e.g., mimetics, small molecules, drugs and pro-drugs) that can 

1 0 be identified using the methods, or are provided by compositions, disclosed herein. Other 
mechanisms include sequestration of hTR and/or telomerase associated proteins, and 
interference with the assembly of the telomerase RNP from its component subunits. In a 
related mechanism, an hTRT promoter sequence is operably linked to a gene encoding a toxin 
and introduced into a cell; if or when hTRT transcriptional activators are expressed or 

15 activated in the cell, the toxin will be expressed, resulting in specific cell killing. 

A related method for reducing the proliferative capacity of a cell involves 
introducing an hTRT variant with low fidelity (i.e., one with a high, e.g., greater than 1%, 
error rate) such that aberrant telomeric repeats are synthesized. These aberrant repeats affect 
telomere protein binding and lead to chromosomal rearrangements and aberrations and/or 

20 lead to cell death. 

Similarly, telomerase activity may be increased through any of several 
mechanisms, or a combination of mechanisms. These include increasing the amount of hTRT 
in a cell. Usually this is carried out by introducing an hTRT polypeptide-encoding 
polynucleotide into the cell (e.g., a recombinantly produced polypeptide comprising an hTRT 

25 DNA sequence operably linked to a promoter, or a stable hTRT mRNA). Alternatively, a 
catalytically active hTRT polypeptide can itself be introduced into a cell or tissue, e.g., by 
microinjection or other means known in the art. In other mechanisms, expression from the 
endogenous hTRT gene or the stability of hTRT gene products in the cell can be increased. 
Telomerase activity in a cell can also be increased by interfering with the interaction of 

30 endogenous telomerase inhibitors and the telomerase RNP, or endogenous hTRT 

transcription repressors and the hTRT gene; by increasing expression or activity of hTRT 
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transcription activators; and other means apparent to those of skill upon review of this 
disclosure. 

E) INTERVENTION AGENTS 
5 1) TRT PROTEINS & PEPTIDES 

In one embodiment, the invention provides telomerase modulatory 
polypeptides (i.e., proteins, polypeptides, and peptides) that increase or reduce telomerase 
activity which can be introduced into a target cell directly (e.g., by injection, liposome- 
mediated fusion, application of a hydrogel to the tumor [e.g., melanoma] surface, fusion or 

1 0 attachment to herpes virus structural protein VP22, and other means described herein and 

known in the art). In a second embodiment, telomerase modulatory proteins and peptides of 
the invention are expressed in a cell by introducing a nucleic acid (e.g., a DNA expression 
vector or mRNA) encoding the desired protein or peptide into the cell. Expression may be 
either constitutive or inducible depending on the vector and choice of promoter {see 

15 discussion below). Messenger RNA preparations encoding hTRT are especially useful when 
only transient expression (e.g., transient activation of telomerase) is desired. Methods for 
introduction and expression of nucleic acids into a cell are well known in the art (also, see 
elsewhere in this specification, e.g., sections on oligonucleotides, gene therapy methods). 

In one aspect of the invention, a telomerase modulatory polypeptide that 

20 increases telomerase activity in a cell is provided. In one embodiment, the polypeptide is a 
catalytically active hTRT polypeptide capable of directing the synthesis (in conjunction with 
an RNA template such as hTR) of human telomeric DNA. This activity can be measured, as 
discussed above, e.g., using a telomerase activity assay such as a TRAP assay. In one 
embodiment, the polypeptide is a full-length hTRT protein, having a sequence of, or 

25 substantially identical to, the sequence of 1 132 residues of SEQUENCE ID No: 2. In another 
embodiment, the polypeptide is a variant of the hTRT protein of SEQUENCE ID No: 2, such 
as a fusion polypeptide, derivatized polypeptide, truncated polypeptide, conservatively 
substituted polypeptide, activity-modified polypeptide, or the like. A fusion or derivatized 
protein may include a targeting moiety that increases the ability of the polypeptide to traverse 

30 a cell membrane or causes the polypeptide to be delivered to a specified cell type (e.g., liver 
cells or tumor cells) preferentially or cell compartment (e.g., nuclear compartment) 
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preferentially. Examples of targeting moieties include lipid tails, amino acid sequences such 
as antennapoedia peptide or a nuclear localization signal (NLS; e.g., Xenopus nucleoplasmin 
Robbins et al., 1991, Cell 64:615). Naturally occurring hTRT protein (e.g., having a 
sequence of, or substantially identical to, SEQUENCE ID NO: 2) acts in the cell nucleus. 
5 Thus, it is likely that one or more subsequences of SEQUENCE ID NO: 2, such as residues 
193-196 (PRRR) and residues 235-240 (PKRPRR) act as a nuclear localization signal. The 
small regions are likely NLSs based on the observation that many NLSs comprise a 4 residue 
pattern composed of basic amino acids (K or R), or composed of three basic amino acids (K 
or R) and H or P; a pattern starting with P and followed within 3 residues by a basic segment 

10 containing 3 K or R residues out of 4 residues (see, e.g., Nakai et aL, 1992, Genomics 

14:897). Deletion of one or both of these sequences and/or additional localization sequences 
is expected to interfere with hTRT transport to the nucleus and/or increase hTRT turnover, 
and is useful for preventing access of telomerase to its nuclear substrates and decreasing 
proliferative potential. Moreover, a variant hTRT polypeptide lacking NLS may assemble 

15 into an RNP that will not be able to maintain telomere length, because the resulting enzyme 
cannot enter the nucleus. 

The hTRT polypeptides of the invention will typically be associated in the 
target cell with a telomerase RNA, such as hTR, especially when they are used to increase 
telomerase activity in a cell. In one embodiment, an introduced hTRT polypeptide associates 

20 with an endogenous hTR to form a catalytically active RNP (e.g., an RNP comprising the 
hTR and a full-length polypeptide having a sequence of SEQUENCE ID NO:2). The RNP 
so-formed may also associate with other, e.g., telomerase-associated, proteins. In other 
embodiments, telomerase RNP (containing hTRT protein, hTR and optionally other 
components) is introduced as a complex to the target cell. 

25 In a related embodiment, an hTRT expression vector is introduced into a cell 

(or progeny of a cell) into which a telomerase RNA (e.g., hTR) expression vector is 
simultaneously, subsequently or has been previously introduced. In this embodiment, hTRT 
protein and telomerase RNA are coexpressed in the cell and assemble to form a telomerase 
RNP. A preferred telomerase RNA is hTR. An expression vector useful for expression of 

30 hTR in a cell is described supra (see U.S. Patent 5,583,016). In yet another embodiment, the 
hTRT polypeptide and hTR RNA (or equivalent) are associated in vitro to form a complex, 
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which is then introduced into the target cells, e.g., by liposome mediated transfer. 

In another aspect, the invention provides hTRT polypeptides useful for 
reducing telomerase activity in a cell. As above, these "inhibitory" polypeptides can be 
introduced directly, or by expression of recombinant nucleic acids in the cell. It will be 
recognized that peptide mimetics or polypeptides comprising nonstandard amino acids (i.e., 
other than the 20 amino acids encoded by the genetic code or their normal derivatives) will 
typically be introduced directly. 

In one embodiment, inhibition of telomerase activity results from the 
sequestration of a component required for accurate telomere elongation. Examples of such 
components are hTRT and hTR. Thus, administration of a polypeptide that binds hTR, but 
which does not have telomerase catalytic activity, can reduce endogenous telomerase activity 
in the cell. In a related embodiment, the hTRT polypeptide may bind a cell component other 
than hTR, such as one or more telomerase-associated proteins, thereby interfering with 
telomerase activity in the cell. 

In another embodiment, hTRT polypeptides of the invention interfere (e.g., by 
competition) with the interaction of endogenously expressed hTRT protein and another 
cellular component required for telomerase function, such as hTR, telomeric DNA, 
telomerase-associated proteins, telomere-associated proteins, telomeres, cell cycle control 
proteins, DNA repair enzymes, histone or non-histone chromosomal proteins, or others. 

In selecting molecules (e.g., polypeptides) of the invention that affect the 
interaction of endogenously expressed hTRT protein and other cellular components, one may 
prefer molecules that include one or more of the conserved motifs of the hTRT protein, as 
described herein. The evolutionary conservation of these regions indicates the important 
function in the proper functioning of human telomerase contributed by these motifs, and the 
motifs are thus generally useful sites for changing hTRT protein function to create variant 
hTRT proteins of the invention. Thus, variant hTRT polypeptides having mutations in 
conserved motifs will be particularly useful for some applications of the invention. 

In another embodiment, expression of the endogenous hTRT gene is repressed 
by introduction into the cell of a large amount of hTRT polypeptide (e.g., typically at least 
about 2-fold more than the endogenous level, more often at least about 10- to about 100-fold) 
which acts via a feedback loop to inhibit transcription of the hTRT gene, processing of the 
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hTRT pre-mRNA, translation of the hTRT mRNA, or assembly and transport of the 
telomerase RNP. 

2) OLIGONUCLEOTIDES 

a) ANTISENSE CONSTRUCTS 

The invention provides methods and antisense oligonucleotide or 
polynucleotide reagents which can be used to reduce expression of hTRT gene products in 
vitro or in vivo. Administration of the antisense reagents of the invention to a target cell 
results in reduced telomerase activity, and is particularly useful for treatment of diseases 
characterized by high telomerase activity (e.g., cancers). Without intending to be limited to 
any particular mechanism, it is believed that antisense oligonucleotides bind to, and interfere 
with the translation of, the sense hTRT mRNA. Alternatively, the antisense molecule may 
render the hTRT mRNA susceptible to nuclease digestion, interfere with transcription, 
interfere with processing, localization or otherwise with RNA precursors ("pre-mRNA"), 
repress transcription of mRNA from the hTRT gene, or act through some other mechanism. 
However, the particular mechanism by which the antisense molecule reduces hTRT 
expression is not critical. 

The antisense polynucleotides of the invention comprise an antisense sequence 
of at least 7 to 10 to typically 20 or more nucleotides that specifically hybridize to a sequence 
from mRNA encoding hTRT or mRNA transcribed from the hTRT gene. More often, the 
antisense polynucleotide of the invention is from about 10 to about 50 nucleotides in length 
or from about 14 to about 35 nucleotides in length. In other embodiments, antisense 
polynucleotides are polynucleotides of less than about 100 nucleotides or less than about 200 
nucleotides. In general, the antisense polynucleotide should be long enough to form a stable 
duplex but short enough, depending on the mode of delivery, to administer in vivo, if desired. 
The minimum length of a polynucleotide required for specific hybridization to a target 
sequence depends on several factors, such as G/C content, positioning of mismatched bases 
(if any), degree of uniqueness of the sequence as compared to the population of target 
polynucleotides, and chemical nature of the polynucleotide (e.g., methylphosphonate 
backbone, peptide nucleic acid, phosphorothioate), among other factors. 

Generally, to assure specific hybridization, the antisense sequence is 
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substantially complementary to the target hTRT mRNA sequence. In certain embodiments, 
the antisense sequence is exactly complementary to the target sequence. The antisense 
polynucleotides may also include, however, nucleotide substitutions, additions, deletions, 
transitions, transpositions, or modifications, or other nucleic acid sequences or non-nucleic 
5 acid moieties so long as specific binding to the relevant target sequence corresponding to 
hTRT RNA or its gene is retained as a functional property of the polynucleotide. 

In one embodiment, the antisense sequence is complementary to relatively 
accessible sequences of the hTRT mRNA (e.g., relatively devoid of secondary structure). 
This can be determined by analyzing predicted RNA secondary structures using, for example, 
10 the MFOLD program (Genetics Computer Group, Madison WI) and testing in vitro or in vivo 
as is known in the art. Examples of oligonucleotides that may be tested in cells for antisense 
suppression of hTRT function are those capable of hybridizing to (i.e., substantially 
complementary to) the following positions from SEQUENCE ID NO: 1 : 40-60; 260-280; 
500-520; 770-790; 885-905; 1000-1020 ; 1300-1320; 1520-1540; 2110-2130; 2295-2315; 
15 2450-2470; 2670-2690; 3080-3110; 3140-3160; and 3690-3710. Another useful method for 
identifying effective antisense compositions uses combinatorial arrays of oligonucleotides 
(see, e.g., Milner et al., 1997, Nature Biotechnology 15:537). 

The invention also provides an antisense polynucleotide that has sequences in 
addition to the antisense sequence (i.e., in addition to anti-hTRT-sense sequence). In this 
20 case, the antisense sequence is contained within a polynucleotide of longer sequence. In 
another embodiment, the sequence of the polynucleotide consists essentially of, or is, the 
antisense sequence. 

The antisense nucleic acids (DNA, RNA, modified, analogues, and the like) 
can be made using any suitable method for producing a nucleic acid, such as the chemical 
25 synthesis and recombinant methods disclosed herein. In one embodiment, for example, 

antisense RNA molecules of the invention may be prepared by de novo chemical synthesis or 
by cloning. For example, an antisense RNA that hybridizes to hTRT mRNA can be made by 
inserting (ligating) an hTRT DNA sequence (e.g., SEQUENCE ID No; 1, or fragment 
thereof) in reverse orientation operably linked to a promoter in a vector (e.g., plasmid). 
30 Provided that the promoter and, preferably termination and polyadenylation signals, are 
properly positioned, the strand of the inserted sequence corresponding to the noncoding 
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strand will be transcribed and act as an antisense oligonucleotide of the invention. 

The antisense oligonucleotides of the invention can be used to inhibit 
telomerase activity in cell-free extracts, cells, and animals, including mammals and humans. 
For example, the phosphorothioate antisense oligonucleotides: 
5 A) S'-GGCATCGCGGGGGTGGCCGGG 

B) 5 f -CAGCGGGGAGCGCGCGGCATC 

C) S'-CAGCACCTCGCGGTAGTGGCT 

D) 5 ! -GGACACCTGGCGGAAGGAGGG 

10 can be used to inhibit telomerase activity. At 10 micromolar concentration each 

oligonucleotide, mixtures of oligonucleotides A and B; A, B, C, and D; and A, C, and D 
inhibited telomerase activity in 293 cells when treated once per day for seven days. Inhibition 
was also observed when an antisense hTR molecule (5 ' -GCTCTAGAATG AAGGGTG-3 ') 
was used in combination with oligonucleotides A, B, and C; A, B, and D; and A and C. 

1 5 Useful control oligonucleotides in such experiments include: 

51) 5-GCGACGACTGACATTGGCCGG 

52) 5-GGCTCGAAGTAGCACCGGTGC 

53) S'-GTGGGAACAGGCCGATGTCCC 

20 To determine the optimum antisense oligonucleotide of the invention for the 

particular application of interest, one can perform a scan using antisense oligonucleotide sets 
of the invention. One illustrative set is the set of 30-mer oligonucleotides that span the hTRT 
mRNA and are offset one from the next by fifteen nucleotides (i.e., ONI corresponds to 
positions 1-30 and is TCCCACGTGCGCAGCAGGACGCAGCGCTGC, ON2 corresponds 

25 to positions 16-45 and is GCCGGGGCCAGGGCTTCCCACGTGCGCAGC, and ON3 

corresponds to positions 31-60 and is GGCATCGCGGGGGTGGCCGGGGCCAGGGCT, 
and so on to the end of the mRNA). Each member of this set can be tested for inhibitory 
activity as disclosed herein. Those oligonucleotides that show inhibitory activity under the 
conditions of interest then identify a region of interest, and other oligonucleotides of the 

30 invention corresponding to the region of interest (i.e., 8-mers, 10-mers, 15-mers, and so on) 
can be tested to identify the oligonucleotide with the preferred activity for the application. 
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Exemplary antisense oligonucleotides include S'-GGCATCGCGGGGGTG 
GCCGGGGCC AGGGCT-3 ' (corresponding to nucleotide positions 31-60 of hTRT); 5 r - 
GCGCA GCGTGCCAGCAGGTGAACCAGCACG-3 1 (corresponding to positions 496-525); 
5'- GCCCGTTCGCATCCCAGACGCCTTCGGGGT-3* (corresponding to positions 631- 
660); and 5 f -ACGCTATGGTTCCAGGCCCGTTCGCATCCC-3 f (corresponding to positions 
646-675). When ACHN cells (NCI #503755) or 293 cells were treated for three days with 10 
of phosphorothioate oligonucleotides with any of the four sequences supra, inhibition of 
telomerase activity by about 50%-90% (compared to control untreated cells) as measured by a 
TRAP assay, was observed. 

For general methods relating to antisense polynucleotides, see Antisense 
RNA and DNA, (1988), D.A. Melton, Ed., Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY). See also, Dagle etal., 1991, Nucleic Acids Research, 19:1805. For a review of 
antisense therapy, see, e.g., Uhlmann et al., Chem. Reviews, 90:543-584 (1990). 

b) TRIPLEX OLIGO- AND POLYNUCLEOTIDES 

The present invention provides oligo- and polynucleotides (e.g., DNA, RNA, 
PNA or the like) that bind to double-stranded or duplex hTRT nucleic acids (e.g., in a folded 
region of the hTRT RNA or in the hTRT gene), forming a triple helix-containing, or "triplex" 
nucleic acid. Triple helix formation results in inhibition of hTRT expression by, for example, 
preventing transcription of the hTRT gene, thus reducing or eliminating telomerase activity in 
a cell. Without intending to be bound by any particular mechanism, it is believed that triple 
helix pairing compromises the ability of the double helix to open sufficiently for the binding 
of polymerases, transcription factors, or regulatory molecules to occur. 

Triplex oligo- and polynucleotides of the invention are constructed using the 
base-pairing rules of triple helix formation (see, e.g., Cheng et al., 1988, J. Biol Chem. 263: 
15110; Ferrin and Camerini-Otero, 1991, Science 354:1494; Ramdas etal., 1989, J. Biol 
Chem. 264:17395; Strobel et al., 1991, Science 254:1639; and Rigas et al, 1986, Proc. Natl 
Acad Set U.S.A. 83: 9591; each of which is incorporated herein by reference) and the hTRT 
mRNA and/or gene sequence. Typically, the triplex-forming oligonucleotides of the 
invention comprise a specific sequence of from about 10 to at least about 25 nucleotides or 
longer "complementary" to a specific sequence in the hTRT RNA or gene (i.e., large enough 
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to form a stable triple helix, but small enough, depending on the mode of delivery, to 
administer in vivo, if desired). In this context, "complementary" means able to form a stable 
triple helix. In one embodiment, oligonucleotides are designed to bind specifically to the 
regulatory regions of the hTRT gene (e.g., the hTRT 5'-flanking sequence, promoters, and 
enhancers) or to the transcription initiation site, (e.g., between -10 and +10 from the 
transcription initiation site). For a review of recent therapeutic advances using triplex DNA, 
see Gee et al., in Huber and Carr, 1994, Molecular and Immunologic Approaches, Futura 
Publishing Co, Mt Kisco NY and Rininsland et al., 1997, Proc. Natl Acad. Set USA 
94:5854, which are both incorporated herein by reference. 

c) RIBOZYMES 

The present invention also provides ribozymes useful for inhibition of 
telomerase activity. The ribozymes of the invention bind and specifically cleave and 
inactivate hTRT mRNA. Useful ribozymes can comprise 5 - and 3'-terminal sequences 
complementary to the hTRT mRNA and can be engineered by one of skill on the basis of the 
hTRT mRNA sequence disclosed herein (see PCT publication WO 93/23572, supra). 
Ribozymes of the invention include those having characteristics of group I intron ribozymes 
(Cech, 1995, Biotechnology 13:323) and others of hammerhead ribozymes (Edgington, 1992, 
Biotechnology 10:256). 

Ribozymes of the invention include those having cleavage sites such as GUA, 
GUU and GUC. Other optimum cleavage sites for ribozyme-mediated inhibition of 
telomerase activity in accordance with the present invention include those described in PCT 
publications WO 94/02595 and WO 93/23569, both incorporated herein by reference. Short 
RNA oligonucleotides between 15 and 20 ribonucleotides in length corresponding to the 
region of the target hTRT gene containing the cleavage site can be evaluated for secondary 
structural features that may render the oligonucleotide more desirable. The suitability of 
cleavage sites may also be evaluated by testing accessibility to hybridization with 
complementary oligonucleotides using ribonuclease protection assays, or by testing for in 
vitro ribozyme activity in accordance with standard procedures known in the art. 

As described by Hu et al., PCT publication WO 94/03596, incorporated herein 
by reference, antisense and ribozyme functions can be combined in a single oligonucleotide. 
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Moreover, ribozymes can comprise one or more modified nucleotides or modified linkages 
between nucleotides, as described above in conjunction with the description of illustrative 
antisense oligonucleotides of the invention. 

In one embodiment, the ribozymes of the invention are generated in vitro and 
introduced into a cell or patient. In another embodiment, gene therapy methods are used for 
expression of ribozymes in a target cell ex vivo or in vivo. 

d) ADMINISTRATION OF OLIGONUCLEOTIDES 

Typically, the therapeutic methods of the invention involve the administration 
of an oligonucleotide that functions to inhibit or stimulate telomerase activity under in vivo 
physiological conditions, and is relatively stable under those conditions for a period of time 
sufficient for a therapeutic effect. As noted above, modified nucleic acids may be useful in 
imparting such stability, as well as for targeting delivery of the oligonucleotide to the desired 
tissue, organ, or cell. 

Oligo- and poly-nucleotides can be delivered directly as a drug in a suitable 
pharmaceutical formulation, or indirectly by means of introducing a nucleic acid into a cell, 
including liposomes, immunoliposomes, ballistics, direct uptake into cells, and the like as 
described herein. For treatment of disease, the oligonucleotides of the invention will be 
administered to a patient in a therapeutically effective amount. A therapeutically effective 
amount is an amount sufficient to ameliorate the symptoms of the disease or modulate 
telomerase activity in the target cell, e.g., as can be measured using a TRAP assay or other 
suitable assay of telomerase biological function. Methods useful for delivery of 
oligonucleotides for therapeutic purposes are described in U.S. Patent 5,272,065, 
incorporated herein by reference. Other details of administration of pharmaceutically active 
compounds are provided below. In another embodiment, oligo- and poly-nucleotides can be 
delivered using gene therapy and recombinant DNA expression plasmids of the invention. 

3) GENE THERAPY 

Gene therapy refers to the introduction of an otherwise exogenous 
polynucleotide which produces a medically useful phenotypic effect upon the (typically) 
mammalian cell(s) into which it is transferred. In one aspect, the present invention provides 
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gene therapy methods and compositions for treatment of telomerase-associated conditions. In 
illustrative embodiments, gene therapy involves introducing into a cell a vector that expresses 
an hTRT gene product (such as an hTRT protein substantially similar to the hTRT 
polypeptide having a sequence of SEQUENCE ID NO: 2, e.g., to increase telomerase 
5 activity, or an inhibitory hTRT polypeptide to reduce activity), expresses a nucleic acid 
having an hTRT gene or mRNA sequence (such as an antisense RNA, e.g., to reduce 
telomerase activity), expresses a polypeptide or polynucleotide that otherwise affects 
expression of hTRT gene products (e.g., a ribozyme directed to hTRT mRNA to reduce 
telomerase activity), or replaces or disrupts an endogenous hTRT sequence (e.g., gene 
10 replacement and "gene knockout," respectively). Numerous other embodiments will be 
evident to one of skill upon review of the disclosure herein. In one embodiment, a vector 
encoding hTR is also introduced. In another embodiment, vectors encoding telomerase- 
associated proteins are also introduced with or without a vector for hTR. 

Vectors useful in hTRT gene therapy can be viral or nonviral, and include 
1 5 those described supra in relation to the hTRT expression systems of the invention. It will be 
understood by those of skill in the art that gene therapy vectors may comprise promoters and 
other regulatory or processing sequences, such as are described in this disclosure. Usually the 
vector will comprise a promoter and, optionally, an enhancer (separate from any contained 
within the promoter sequences) that serve to drive transcription of an oligoribonucleotide, as 
20 well as other regulatory elements that provide for episomal maintenance or chromosomal 

integration and for high-level transcription, if desired. A plasmid useful for gene therapy can 
comprise other functional elements, such as selectable markers, identification regions, and 
other sequences. The additional sequences can have roles in conferring stability both outside 
and within a cell, targeting delivery of hTRT nucleotide sequences (sense or antisense) to a 
25 specified organ, tissue, or cell population, mediating entry into a cell, mediating entry into the 
nucleus of a cell and/or mediating integration within nuclear DNA. For example, 
aptamer-like DNA structures, or other protein binding moieties sites can be used to mediate 
binding of a vector to cell surface receptors or to serum proteins that bind to a receptor 
thereby increasing the efficiency of DNA transfer into the cell. Other DNA sites and 
30 structures can directly or indirectly bind to receptors in the nuclear membrane or to other 

proteins that go into the nucleus, thereby facilitating nuclear uptake of a vector. Other DNA 
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sequences can directly or indirectly affect the efficiency of integration. 

Suitable gene therapy vectors may, or may not, have an origin of replication. 
For example, it is useful to include an origin of replication in a vector for propagation of the 
vector prior to administration to a patient. However, the origin of replication can often be 
5 removed before administration if the vector is designed to integrate into host chromosomal 
DNA or bind to host mRNA or DNA. In some situations (e.g., tumor cells) it may not be 
necessary for the exogenous DNA to integrate stably into the transduced cell, because 
transient expression may suffice to kill the tumor cells. 

As noted, the present invention also provides methods and reagents for gene 

10 replacement therapy (i.e., replacement by homologous recombination of an endogenous 
hTRT gene with a recombinant gene). Vectors specifically designed for integration by 
homologous recombination may be used. Important factors for optimizing homologous 
recombination include the degree of sequence identity and length of homology to 
chromosomal sequences. The specific sequence mediating homologous recombination is also 

1 5 important, because integration occurs much more easily in transcriptionally active DNA. 
Methods and materials for constructing homologous targeting constructs are described by 
e.g., Mansour et al., 1988, Nature 336: 348; Bradley et al., 1992, Bio/Technology 10: 534. 
See also, U.S. Patent Nos. 5,627,059; 5,487,992; 5,631,153; and 5,464,764. In one 
embodiment, gene replacement therapy involves altering or replacing all or a portion of the 

20 regulatory sequences controlling expression of the hTRT gene that is to be regulated. For 
example, the hTRT promoter sequences (e.g., such as are found in SEQUENCE ID NO: 6) 
may be disrupted (to decrease hTRT expression or to abolish a transcriptional control site) or 
an exogenous promoter (e.g., to increase hTRT expression) substituted. 

The invention also provides methods and reagents for hTRT "gene knockout" 

25 (i.e., deletion or disruption by homologous recombination of an endogenous hTRT gene using 
a recombinantly produced vector). In gene knockout, the targeted sequences can be 
regulatory sequences (e.g., the hTRT promoter), or RNA or protein coding sequences. The 
use of homologous recombination to alter expression of endogenous genes is described in 
detail in U.S. Patent No. 5,272,071 (and the U.S. Patents cited supra), WO 91/09955, WO 

30 93/09222, WO 96/2941 1, WO 95/31560, and WO 91/12650. See also, Moynahan et al., 
1996, Hum. Mol Genet 5:875. 
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The invention further provides methods for specifically killing telomerase- 
positive cells, or preventing transformation of telomerase negative cells to a telomerase 
positive state, using the hTRT gene promoter to regulate expression of a protein toxic to the 
cell. As shown in Example 14, an hTRT promoter sequence may be operably linked to a 
reporter gene such that activation of the promoter results in expression of the protein encoded 
by the reporter gene. If, instead of a reporter protein, the encoded protein is toxic to the cell, 
activation of the promoter leads to cell morbidity or death. In one embodiment of the present 
invention, a vector comprising an hTRT promoter operably linked to a gene encoding a toxic 
protein is introduced into cells, such as human cells, e.g., cells in a human patient, resulting in 
cell death of cells in which hTRT promoter activating factors are expressed, such as cancer 
cells. In a related embodiment, the encoded protein is not itself toxic to a cell, but encodes an 
activity that renders the cell sensitive to an otherwise nontoxic drug. For example, tumors 
can be treated by introducing an hTRT-promoter-Herpes thymidine kinase (TK) gene fusion 
construct into tumor cells, and administering gancyclovir or the equivalent (see, e.g., Moolton 
and Wells, 1990, J. Nat'l Cane. Inst 82:297). The art knows of numerous other suitable toxic 
or potentially toxic proteins and systems (using promoter sequences other that hTRT) that 
may be modified and applied in accordance with the present invention by one of skill in the 
art upon review of this disclosure. 

Gene therapy vectors may be introduced into cells or tissues in vivo, in vitro or 
ex vivo. For ex vivo therapy, vectors may be introduced into cells, e.g., stem cells, taken from 
the patient and clonally propagated for autologous transplant back into the same patient (see, 
e.g., U.S. Patent Nos. 5,399,493 and 5,437,994, the disclosures of which are herein 
incorporated by reference). Cells that can be targeted for hTRT gene therapy aimed at 
increasing the telomerase activity of a target cell include, but are not limited to, embryonic 
stem or germ cells, particularly primate or human cells, as noted supra, hematopoietic stem 
cells (AIDS and post-chemotherapy), vascular endothelial cells (cardiac and cerebral vascular 
disease), skin fibroblasts and basal skin keratinocytes (wound healing and burns), 
chondrocytes (arthritis), brain astrocytes and microglial cells (Alzheimer's Disease), 
osteoblasts (osteoporosis), retinal cells (eye diseases), and pancreatic islet cells (Type I 
diabetes) and any of the cells listed in Table 3, infra, as well as any other cell types known to 
divide. 
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In one embodiment of the invention, an inducible promoter operably linked to 
a TRT, such as hTRT, coding sequence (or variant) is used to modulate the proliferative 
capacity of cells in vivo or in vitro. In a particular embodiment, for example, insulin- 
producing pancreatic cells transfected with an hTRT expression vector under the control of an 
inducible promoter are introduced into a patient. The proliferative capacity of the cells can 
then be controlled by administration to the patient of the promoter activating agent (e.g., 
tetracycline) to enable the cells to multiply more than otherwise would have been possible. 
Cell proliferation can then be terminated, continued, or reinitiated as desired by the treating 
physician. 

4) VACCINES AND ANTIBODIES 

Immuogenic peptides or polypeptides having an hTRT sequence can be used 
to elicit an anti-hTRT immune response in a patient (i.e., act as a vaccine). Exemplary 
immunogenic hTRT peptides and polypeptides are described infra in Examples 6 and 8. An 
immune response can also be raised by delivery of plasmid vectors encoding the polypeptide 
of interest (i.e., administration of "naked DNA"). The nucleic acids of interest can be 
delivered by injection, liposomes, or other means of administration. In one embodiment, 
immunization modes that elicit in the subject a Class I MHC restricted cytotoxic lymphocyte 
response against telomerase expressing cells are chosen. Once immunized, the individual or 
animal will elicit a heightened immune response against cells expressing high levels of 
telomerase (e.g., malignant cells). 

Anti-hTRT antibodies, e.g., murine, human, or humanized monoclonal 
antibodies may also be administered to a patient (e.g., passive immunization) to effect an 
immune response against telomerase-expressing cells. 

F) PHARMACEUTICAL COMPOSITIONS 

In related aspects, the invention provides pharmaceutical compositions that 
comprise hTRT oligo- and poly-nucleotides, polypeptides, and antibodies, agonists, 
antagonists, or inhibitors, alone or in combination with at least one other agent, such as a 
stabilizing compound, diluent, carrier, or another active ingredient or agent. 

The therapeutic agents of the invention may be administered in any sterile, 
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biocompatible pharmaceutical carrier, including, but not limited to, saline, buffered saline, 
dextrose, and water. Any of these molecules can be administered to a patient alone, or in 
combination with other agents, drugs or hormones, in pharmaceutical compositions where it 
is mixed with suitable excipient(s), adjuvants, and/or pharmaceutical^ acceptable carriers. In 
one embodiment of the present invention, the pharmaceutically acceptable carrier is 

pharmaceutically inert. 

Administration of pharmaceutical compositions is accomplished orally or 
parenterally. Methods of parenteral delivery include topical, intra-arterial (e.g., directly to the 
tumor), intramuscular, subcutaneous, intramedullary, intrathecal, intraventricular, 
intravenous, intraperitoneal, or intranasal administration. In addition to the active 
ingredients, these pharmaceutical compositions may contain suitable pharmaceutically 
acceptable carriers comprising excipients and other compounds that facilitate processing of 
the active compounds into preparations which can be used pharmaceutically. Further details 
on techniques for formulation and administration may be found in the latest edition of 
"Remington's Pharmaceutical Sciences" (Maack Publishing Co, Easton PA). 

Pharmaceutical compositions for oral administration can be formulated using 
pharmaceutically acceptable carriers well known in the art in dosages suitable for oral 
administration. Such carriers enable the pharmaceutical compositions to be formulated as 
tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, etc., suitable for 
ingestion by the patient. See PCT publication WO 93/23572. 

Pharmaceutical preparations for oral use can be obtained through combination 
of active compounds with solid excipient, optionally grinding a resulting mixture, and 
processing the mixture of granules, after adding suitable additional compounds, if desired, to 
obtain tablets or dragee cores. Suitable excipients are carbohydrate or protein fillers include, 
but are not limited to sugars, including lactose, sucrose, mannitol, or sorbitol; starch from 
corn, wheat, rice, potato, or other plants; cellulose such as methyl cellulose, 
hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; and gums including 
arabic and tragacanth; as well as proteins such as gelatin and collagen. If desired, 
disintegrating or solubilizing agents may be added, such as the cross-linked polyvinyl 
pyrrolidone, agar, alginic acid, or a salt thereof, such as sodium alginate. 

Dragee cores are provided with suitable coatings such as concentrated sugar 
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solutions, which may also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, 
polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents 
or solvent mixtures. Dyestuffs or pigments may be added to the tablets or dragee coatings for 
product identification or to characterize the quantity of active compound (i.e., dosage). 

5 Pharmaceutical preparations which can be used orally include push-fit 

capsules made of gelatin, as well as soft, sealed capsules made of gelatin and a coating such 
as glycerol or sorbitol. Push-fit capsules can contain active ingredients mixed with a filler or 
binders such as lactose or starches, lubricants such as talc or magnesium stearate, and, 
optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended 

1 0 in suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycol with or 

without stabilizers. 

Pharmaceutical formulations for parenteral administration include aqueous 
solutions of active compounds. For injection, the pharmaceutical compositions of the 
invention may be formulated in aqueous solutions, preferably in physiologically compatible 

1 5 buffers such as Hank's solution, Ringer's solution, or physiologically buffered saline. 

Aqueous injection suspensions may contain substances which increase the viscosity of the 
suspension, such as sodium carboxymethyl cellulose, sorbitol, or dextran. Additionally, 
suspensions of the active compounds may be prepared as appropriate oily injection 
suspensions. Suitable lipophilic solvents or vehicles include fatty oils such as sesame oil, or 

20 synthetic fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. Optionally, the 
suspension may also contain suitable stabilizers or agents which increase the solubility of the 
compounds to allow for the preparation of highly concentrated solutions. 

For topical or nasal administration, penetrants appropriate to the particular 
barrier to be permeated are used in the formulation. Such penetrants are generally known in 

25 the art. 

The pharmaceutical compositions of the present invention may be 
manufactured in a manner similar to that known in the art (e.g., by means of conventional 
mixing, dissolving, granulating, dragee-making, levigating, emulsifying, encapsulating, 
entrapping or lyophilizing processes). 
30 The pharmaceutical composition may be provided as a salt and can be formed 

with many acids, including but not limited to hydrochloric, sulfuric, acetic, lactic, tartaric, 

116 



malic, succinic, etc. Salts tend to be more soluble in aqueous or other protonic solvents that 
are the corresponding free base forms. In other cases, the preferred preparation may be a 
lyophilized powder in 1 mM-50 mM histidine, 0.1%-2% sucrose, 2%-7% mannitol at a pH 
range of 4.5 to 5.5, that is combined with buffer prior to use. 

After pharmaceutical compositions comprising a compound of the invention 
formulated in a acceptable carrier have been prepared, they can be placed in an appropriate 
container and labeled for treatment of an indicated condition. For administration of human 
telomerase proteins and nucleic acids, such labeling would include amount, frequency and 
method of administration. 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to achieve 
the intended purpose. "Therapeutically effective amount" or "pharmacologically effective 
amount" are well recognized phrases and refer to that amount of an agent effective to produce 
the intended pharmacological result. Thus, a therapeutically effective amount is an amount 
sufficient to ameliorate the symptoms of the disease being treated. One useful assay in 
ascertaining an effective amount for a given application (e.g., a therapeutically effective 
amount) is measuring the effect on telomerase activity in a target cell. The amount actually 
administered will be dependent upon the individual to which treatment is to be applied, and 
will preferably be an optimized amount such that the desired effect is achieved without 
significant side-effects. The determination of a therapeutically effective dose is well within 
the capability of those skilled in the art. 

For any compound, the therapeutically effective dose can be estimated initially 
either in cell culture assays or in any appropriate animal model. The animal model is also 
used to achieve a desirable concentration range and route of administration. Such 
information can then be used to determine useful doses and routes for administration in 
humans. 

A therapeutically effective amount refers to that amount of protein, 
polypeptide, peptide, antibody, oligo- or polynucleotide, agonist or antagonists which 
ameliorates the symptoms or condition. Therapeutic efficacy and toxicity of such compounds 
can be determined by standard pharmaceutical procedures in cell cultures or experimental 
animals {e.g., ED 50 , the dose therapeutically effective in 50% of the population; and LD 50 , the 
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dose lethal to 50% of the population). The dose ratio between therapeutic and toxic effects is 
the therapeutic index, and it can be expressed as the ratio, ED 5Q fLD 5Q . Pharmaceutical 
compositions which exhibit large therapeutic indices are preferred. The data obtained from 
cell culture assays and animal studies is used in formulating a range of dosage for human use. 
The dosage of such compounds lies preferably within a range of circulating concentrations 
that include the ED 50 with little or no toxicity. The dosage varies within this range depending 
upon the dosage form employed, sensitivity of the patient, and the route of administration. 

The exact dosage is chosen by the individual physician in view of the patient 
to be treated. Dosage and administration are adjusted to provide sufficient levels of the active 
moiety or to maintain the desired effect. Additional factors which may be taken into account 
include the severity of the disease state (e.g., tumor size and location; age, weight and gender 
of the patient; diet, time and frequency of administration, drug combination(s), reaction 
sensitivities, and tolerance/response to therapy). Long acting pharmaceutical compositions 
might be administered every 3 to 4 days, every week, or once every two weeks depending on 
half-life and clearance rate of the particular formulation. Guidance as to particular dosages 
and methods of delivery is provided in the literature (see, US Patent Nos. 4,657,760; 
5,206,344; and 5,225,212, herein incorporated by reference). Those skilled in the art will 
typically employ different formulations for nucleotides than for proteins or their inhibitors. 
Similarly, delivery of polynucleotides or polypeptides can be specific to particular cells, 
conditions, locations, and the like. 

VIII. INCREASING PROLIFERATIVE CAPACITY AND PRODUCTION OF 
IMMORTALIZED CELLS, CELL LINES, AND ANIMALS 

As discussed above, most vertebrate cells senesce after a finite number of 
divisions in culture (e.g., 50 to 100 divisions). Certain variant cells, however, are able to 
divide indefinitely in culture (e.g., HeLa cells, 293 cells) and, for this reason, are useful for 
research and industrial applications. Usually these immortal cell lines are derived from 
spontaneously arising tumors, or by transformation by exposure to radiation or a tumor- 
inducing virus or chemical. Unfortunately, a limited selection of cell lines, especially human 
cell lines representing differentiated cell function, is available. Moreover, the immortal cell 
lines presently available are characterized by chromosomal abnormalities (e.g., aneuploidy, 
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gene rearrangements, or mutations). Further, many long-established cell lines are relatively 
undifferentiated (e.g., they do not produce highly specialized products of the sort that 
uniquely characterize particular tissues or organs). Thus, there is a need for new methods of 
generating immortal cells, especially human cells. As used herein, the term "immortalized 
cells" is not limited to cells that proliferate indefinitely, but may also include cells with 
increased proliferative capacity compared to similar wild-type cells. Depending on the cell 
type, increased proliferative capacity may mean proliferation for at least about 100, about 
150, about 200, or about 400 or more generations, or for at least about 6, about 12, about 18, 
about 24 or about 36 or more months in in vitro culture. One use for immortalized cells is in 
production of natural proteins and recombinant proteins (e.g., therapeutic polypeptides such 
as erythropoietin, human growth hormone, insulin, and the like), or antibodies, for which a 
stable, genetically normal cell line is preferred. For production of some recombinant 
proteins, specialized cell types may also be preferred (e.g., pancreatic cells for the production 
of human insulin). Another use for immortalized cells or even mortal cells with increased 
proliferative capacity (relative to unmodified cells) is for introduction into a patient for gene 
therapy, or for replacement of diseased or damaged cells or tissue. For example, autologous 
immune cells containing or expressing a, e.g., recombinant hTRT gene or polypeptide of the 
invention can be used for cell replacement in a patient after aggressive cancer therapy, e.g., 
whole body irradiation. Another use for immortalized cells is for ex vivo production of 
"artificial" tissues or organs (e.g., skin) for therapeutic use. Another use for such cells is for 
screening or validation of drugs, such as telomerase-inhibiting drugs, or for use in production 
of vaccines or biological reagents. Additional uses of the cells of the invention will be 
apparent to those of skill. 

The immortalized cells and cell lines, as well as those of merely increased 
replicative capacity, of the invention are made by increasing telomerase activity in the cell. 
Any method disclosed herein for increasing telomerase activity can be used. Thus, in one 
embodiment, cells are immortalized by increasing the amount of an hTRT polypeptide in the 
cell. In one embodiment, hTRT levels are increased by introducing an hTRT expression 
vector into the cell (with stable transfection sometimes preferred). As discussed above, the 
hTRT coding sequence is usually operably linked to a promoter, which may be inducible or 
constitutively active in the cell. 
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In one embodiment, a polynucleotide comprising a sequence encoding a 
polypeptide of SEQUENCE ID NO: 2, which sequence is operably linked to a promoter (e.g., 
a constitutively expressed promoter, e.g., a sequence of SEQUENCE ID NO: 6), is introduced 
into the cell. In one embodiment the polynucleotide comprises a sequence of SEQUENCE 
ID NO: 1 . Preferably the polynucleotide includes polyadenylation and termination signals. 
In other embodiments, additional elements such as enhancers or others discussed supra are 
included. In an alternative embodiment, the polynucleotide does not include a promoter 
sequence, such sequence being provided by the target cell endogenous genome following 
integration (e.g., recombination, e.g., homologous recombination) of the introduced 
polynucleotide. The polynucleotide may be introduced into the target cell by any method, 
including any method disclosed herein, such as lipofection, electroporation, virosomes, 
liposomes, immunoliposomes, polycatiomnucleic acid conjugates, naked DNA). 

Using the methods of the invention, any vertebrate cell can be caused to have 
an increased proliferative capacity or even be immortalized and sustained indefinitely in 
culture. In one embodiment the cells are mammalian, with human cells preferred for many 
applications. Examples of human cells that can be immortalized include those listed in Table 
3. 

It will be recognized that the "diagnostic" assays of the invention described 
infra may be used to identify and characterize the immortalized cells of the invention. 

TABLE 3 

TTTTMAN CEIXS IN WHICH hTRT EXPRESSIO N MAY BE INCREASED 
Keratinizing Epithelial Cells 

keratinocyte of epidermis (differentiating epidermal cell) 

basal cell of epidermis (stem cell) 

keratinocyte of fingernails and toenails 

basal cell of nail bed (stem cell) 

hair shaft cells 

medullary, cortical, cuticular; hair-root sheath cells, 
cuticular, of Huxley's layer, of Henle's layer external; 
hair matrix cell (stem cell) 

Cells of Wet Stratified Barrier Epithelia 

surface epithelial cell of stratified squamous epithelium of 
tongue, oral cavity, esophagus, anal canal, distal urethra, 
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vagina 

basal cell of these epithelia (stem cell) 
cell of external corneal epithelium 

cell of urinary epithelium (lining bladder and urinary ducts) 

5 

Epithelial Cells Specialized for Exocrine Secretion 

cells of salivary gland 

mucous cell (secretion rich in polysaccharide) 
serous cell (secretion rich in glycoprotein enzymes) cell 
10 of von Ebner's gland in tongue (secretion to wash over 

taste buds) 

cell of mammary gland, secreting milk 

cell of lacrimal gland, secreting tears 

cell of ceruminous gland of ear, secreting wax 
15 cell of eccrine sweat gland, secreting glycoproteins (dark 

cell) 

cell of eccrine sweat gland, secreting small molecules (clear 
cell) 

cell of apocrine sweat gland (odoriferous secretion, 

20 sex- hormone sensitive) 

cell of gland of Moll in eyelid (specialized sweat gland) 
cell of sebaceous gland, secreting lipid-rich sebum 
cell of Bowman's gland in nose (secretion to wash over 
olfactory epithelium) 

25 cell of Brunner's gland in duodenum, secreting alkaline 
solution of mucus and enzymes 

cell of seminal vesicle, secreting components of seminal 
fluid, including fructose (as fuel for swimming sperm) 
cell of prostate gland, secreting other components of seminal 
30 fluid 

cell of bulbourethral gland, secreting mucus 

cell of Bartholin's gland, secreting vaginal lubricant 

cell of gland of Littre, secreting mucus 

cell of endometrium of uterus, secreting mainly carbohydrates 
35 isolated goblet cell of respiratory and digestive tracts, 
secreting mucus 

mucous cell of lining of stomach 

zymogenic cell of gastric gland, secreting pepsinogen 
oxyntic cell of gastric gland, secreting HC1 
40 acinar cell of pancreas, secreting digestive enzymes and 
bicarbonate 

Paneth cell of small intestine, secreting lysozyme 
type II pneumocyte of lung, secreting surfactant 
Clara cell of lung 

45 

Cells specialized for Secretion of Hormones 
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cells of anterior pituitary, secreting 

growth hormone, follicle -stimulating hormone, 
luteinizing hormone, prolactin, adrenocorticotropic 
hormone, and thyroid- stimulating hormone, 
5 cell of intermediate pituitary, secreting 
melanocyte-stimulating hormone 
cells of posterior pituitary, secreting 

oxytocin, vasopressin 
cells of gut, secreting 
10 serotonin, endorphin, somatostatin, gastrin, 

secretin, cholecystokinin, insulin and glucagon 
cells of thyroid gland, secreting 

thyroid hormone, calcitonin 
cells of parathyroid gland, secreting 
15 parathyroid hormone, oxyphil cell 

cells of adrenal gland, secreting 

epinephrine, norepinephrine, and steroid hormones; 
mineralocorticoids 
glucocorticoids 
20 cells of gonads, secreting 

testosterone (Leydig cell of testis) 

estrogen (theca interna cell of ovarian follicle) 

progesterone (corpus luteum cell of ruptured ovarian 

follicle) 

25 cells of juxtaglomerular apparatus of kidney 
juxtaglomerular cell (secreting renin) 
macula densa cell 
peripolar cell 
mesangial cell 

30 

Epithelial Absorptive Cells in Gut, Exocrine Glands, and 
Urogenital Tract 

brush border cell of intestine (with microvilli) 

striated duct cell of exocrine glands 
35 gall bladder epithelial cell 

brush border cell of proximal tubule of kidney 

distal tubule cell of kidney 

nonciliated cell of ductulus efferens 

epididymal principal cell 
40 epididymal basal cell 

Cells Specialized for Metabolism and Storage 

hepatocyte (liver cell) 
fat cells 
45 white fat 

brown fat 
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lipocyte of liver 

Epithelial Cells Serving Primarily a Barrier Function, Lining 
the Lung, Gut, Exocrine Glands, and Urogenital Tract 

5 type I pneumocyte (lining air space of lung) 
pancreatic duct cell (centroacinar cell) 

nonstriated duct cell of sweat gland, salivary gland, mammary 
gland 

parietal cell of kidney glomerulus 
10 podocyte of kidney glomerulus 

cell of thin segment of loop of Henle (in kidney) 

collecting duct cell (in kidney) 

duct cell of seminal vesicle, prostate gland 

15 Epithelial Cells Lining Closed Internal Body Cavities 

vascular endothelial cells of blood vessels and lymphatics 
fenestrated 
continuous 
splenic 

20 synovial cell (lining joint cavities, secreting largely 
hyaluronic acid) 
serosal cell (lining peritoneal, pleural, and pericardial 
cavities) 

squamous cell lining perilymphatic space of ear 
25 cells lining endolymphatic space of ear 
squamous cell 

columnar cells of endolymphatic sac 
with microvilli 
without microvilli 
30 "dark" cell 

vestibular membrane cell (resembling choroid plexus cell) 
stria vascularis basal cell 
stria vascularis marginal cell 
cell of Claudius 
35 cell of Boettcher 

choroid plexus cell (secreting cerebrospinal fluid) 
squamous cell of pia-arachnoid 
cells of ciliary epithelium of eye 
pigmented 
40 nonpigmented 

corneal "endothelial" cell 

Ciliated Cells with Propulsive Function 

of respiratory tract 
45 of oviduct and of endometrium of uterus (in female) 
of rete testis and ductulus efferens (in male) 
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of central nervous system (ependymal cell lining brain 
cavities) 

Cells Specialized for Secretion of Extracellular Matrix 

5 epithelial : 

ameloblast (secreting enamel of tooth) 

planum semilunatum cell of vestibular apparatus of ear 

(secreting proteoglycan) 
interdental cell of organ of Corti (secreting tectorial 
10 "membrane" covering hair cells of organ of Corti) 

nonepithelial (connective tissue) 

fibroblasts (various -of loose connective tissue, of cornea, 
of tendon, of reticular tissue of bone marrow, etc.) 
pericyte of blood capillary 
15 nucleus pulposus cell of intervertebral disc 

cementoblast/cementocyte (secreting bonelike cementum of 

root of tooth) 

odontoblast/odontocyte (secreting dentin of tooth) 
chondrocytes 

20 of hyaline cartilage, of f ibrocartilage, of elastic 

cartilage 
osteoblast /osteocyte 

osteoprogenitor cell (stem cell of osteoblasts) 
hyalocyte of vitreous body of eye 
25 stellate cell of perilymphatic space of ear 

Contractile Cells 

skeletal muscle cells 
red (slow) 
30 white (fast) 

intermediate 

muscle spindle — nuclear bag 
muscle spindle — nuclear chain 
satellite cell (stem cell) 
35 heart muscle cells 
ordinary 
nodal 

Purkin j e f iber 
smooth muscle cells 
40 myoepithelial cells 
of iris 

of exocrine glands 

Cells of Blood and Immune System 

45 red blood cell 
megakaryocyte 
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macrophages 
monocyte 

connective tissue macrophage (various) 

Langerhans cell (in epidermis) 
5 osteoclast (in bone) 

dendritic cell (in lymphoid tissues) 

microglial cell (in central nervous system) 
neutrophil 
eosinophil 
10 basophil 
mast cell 
T lymphocyte 

helper T cell 

suppressor T cell 
15 killer T cell 

B lymphocyte 

IgM 

IgG 

IgA 

20 IgE 

killer cell 

stem cells for the blood and immune system (various) 

Sensory Transducers 

25 photoreceptors 
rod 
cones 

blue sensitive 
green sensitive 
30 red sensitive 

hearing 

inner hair cell of organ of Corti 
outer hair cell of organ of Corti 
acceleration and gravity 
35 type I hair cell of vestibular apparatus of ear 

type II hair cell of vestibular apparatus of ear 
taste 

type 11 taste bud cell 
smell 

40 olfactory neuron 

basal cell of olfactory epithelium (stem cell for olfactory 
neurons ) 
blood Ph 

carotid body cell 
45 type I 

type II 
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touch 

Merkel cell of epidermis 

primary sensory neurons specialized for touch temperature 
primary sensory neurons specialized for temperature 
5 cold sensitive 

heat sensitive 

pain 

primary sensory neurons specialized for pain configurations 
and forces in musculoskeletal system 
10 proprioceptive primary sensory neurons 

Autonomic Neurons 

cholinergic 
adrenergic 
15 peptidergic 

Supporting Cells of Sense Organs and of Peripheral Neurons 

supporting cells of organ of Corti 
inner pillar cell 
20 outer pillar cell 

inner phalangeal cell 
outer phalangeal cell 
border cell 
Hensen cell 

25 supporting cell of vestibular apparatus 

supporting cell of taste bud (type I taste bud cell) 
supporting cell of olfactory epithelium 
Schwann cell 

satellite cell (encapsulating peripheral nerve cell bodies) 
30 enteric glial cell 

Neurons and Glial Cells of Central Nervous System 

neurons 
glial cells 
35 astrocyte 

o 1 i godendr o cy t e 

Lens Cells 

anterior lens epithelial cell 
40 lens fiber (crystallin-containing cell) 
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Pigment Cells 

melanocyte , retinal pigmented epithelial cell 

Germ Cells 

oogonium/ oocyte 
spermatocyte 

spermatogonium (stem cell for spermatocyte) 

Nurse Cells 

ovarian follicle cell 
Sertoli cell (in testis) 
thymus epithelial cell 

Stem Cells 

embryonic stem cell 
embryonic germ cell 

adult stem cell 

fetal stem cell 

IX. DIAGNOSTIC ASSAYS 
A) INTRODUCTION 
1) TRT ASSAYS 

The present invention provides a wide variety of assays for TRT, preferably 
hTRT, and telomerase. These assays provide, inter alia, the basis for sensitive, inexpensive, 
convenient, and widely applicable assays for diagnosis and prognosis of a number of human 
diseases, of which cancer is an illustrative example. As noted supra, hTRT gene products 
(protein and mRNA) are usually elevated in immortal human cells relative to most normal 
mortal cells (i.e., telomerase-negative cells and most telomerase-positive normal adult 
somatic cells). Thus, in one aspect, the invention provides assays useful for detecting or 
measuring the presence, absence, or quantity of an hTRT gene product in a sample from, or 
containing, human or other mammalian or eukayotic cells to characterize the cells as 
immortal (such as a malignant tumor cell) or mortal (such as most normal somatic cells in 
adults) or as telomerase positive or negative. 

Any condition characterized by the presence or absence of an hTRT gene 
product (i.e., protein or RNA) may be diagnosed using the methods and materials described 
herein. These include, as described more fully below, cancers, other diseases of accelerated 
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cell proliferation, immunological disorders, fertility, infertility, and others. Moreover, 
because the degree to which telomerase activity is elevated in cancer cells is correlated with 
characteristics of the tumor, such as metastatic potential, monitoring hTRT, mRNA or protein 
levels can be used to estimate and predict the likely future progression of a tumor. 

In one aspect, the diagnostic and prognostic methods of the invention entail 
determining whether a human TRT gene product is present in a biological sample (e.g., from 
a patient). In a second aspect, the abundance of hTRT gene product in a biological sample 
(e.g., from a patient) is determined and compared to the abundance in a control sample (e.g., 
normal cells or tissues). In a third aspect, the cellular or intracellular localization of an hTRT 
gene product is determined in a cell or tissue sample. In a fourth aspect, host (e.g., patient) 
cells are assayed to identify nucleic acids with sequences characteristic of a heritable 
propensity for abnormal hTRT gene expression (abnormal quantity, regulation, or product), 
such as is useful in genetic screening or genetic counseling. In a fifth aspect, the assays of the 
invention are used detect the presence of anti-hTRT antibodies (e.g., in patient serum). The 
methods described below in some detail are indicative of useful assays that can be carried out 
using the sequences and relationships disclosed herein. However, numerous variations or 
other applications of these assays will be apparent to those of ordinary skill in the art in view 
of this disclosure. 

It will be recognized that, although the assays below are presented in terms of 
diagnostic and prognostic methods, they may be used whenever an hTRT gene, gene product, 
or variant is to be detected, quantified, or characterized. Thus, for example, the "diagnostic" 
methods described infra are useful for assays of hTRT or telomerase during production and 
purification of hTRT or human telomerase, for characterization of cell lines derived from 
human cells (e.g., to identify immortal lines), for characterization of cells, non-human 
animals, plants, fungi, bacteria or other organisms that comprise a human TRT gene or gene 
product (or fragments thereof). 

As used herein, the term "diagnostic" has its usual meaning of identifying the 
presence or nature of a disease (e.g., cancer), condition (e.g., infertile, activated), or status 
(e.g., fertile), and the term "prognostic" has its usual meaning of predicting the probable 
development and/or outcome of a disease or condition. Although these two terms are used in 
somewhat different ways in a clinical setting, it will be understood that any of the assays or 
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assay formats disclosed below in reference to "diagnosis" are equally suitable for 
determination of prognosis because it is well established that higher telomerase activity levels 
are associated with poorer prognoses for cancer patients, and because the present invention 
provides detection methods specific for hTRT, which is expressed at levels that closely 
5 correlate with telomerase activity in a cell . 

2) DIAGNOSIS AND PROGNOSIS OF CANCER 

The determination of an hTRT gene, mRNA or protein level above normal or 
standard range is indicative of the presence of telomerase-positive cells, or immortal, of 

1 0 which certain tumor cells are examples. Because certain embryonic and fetal cells, as well as 
certain adult stem cells, express telomerase, the present invention also provides methods for 
determining other conditions, such as pregnancy, by the detection or isolation of telomerase 
positive fetal cells from maternal blood. These values can be used to make, or aid in making, 
a diagnosis, even when the cells would not have been classified as cancerous or otherwise 

1 5 detected or classified using traditional methods. Thus, the methods of the present invention 
permit detection or verification of cancerous or other conditions associated with telomerase 
with increased confidence, and at least in some instances at an earlier stage. The assays of the 
invention allow discrimination between different classes and grades of human tumors or other 
cell-proliferative diseases by providing quantitative assays for the hTRT gene and gene 

20 products and thereby facilitate the selection of appropriate treatment regimens and accurate 
diagnoses. Moreover, because levels of telomerase activity can be used to distinguish 
between benign and malignant tumors (e.g., U.S. Patent No. 5,489,508; Hiyama et al., 1997, 
Proc. Am Ass. Cancer Res. 38:637), to predict immanence of invasion (e.g., U.S. Patent 
No. 5,639,613; Yashima et al., 1997, Proc. Am Ass. Cancer Res. 38:326), and to correlate 

25 with metastatic potential (e.g., U.S. Patent No. 5,648,21 5; Pandita et al, 1996, Proc. Am Ass. 
Cancer Res. 37:559), these assays will be useful for prophylaxis, detection, and treatment of a 
wide variety of human cancers. 

For prognosis of cancers (or other diseases or conditions characterized by 
elevated telomerase), a prognostic value of hTRT gene product (mRNA or protein) or activity 
30 for a particular tumor type, class or grade, is determined as described infra. hTRT protein or 
mRNA levels or telomerase activity in a patient can also be determined (e.g., using the assays 
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disclosed herein) and compared to the prognostic level. 

Depending on the assay used, in some cases the abundance of an hTRT gene 
product in a sample will be considered elevated whenever it is detectable by the assay. Due 
to the low abundance of hTRT mRNA and protein even in telomerase-positive cells, and the 
rarity or non-existence of these gene products in normal or telomerase-negative cells, 
sensitive assays are required to detect the hTRT gene product if present at all in normal cells. 
If less sensitive assays are selected, hTRT gene products will be undetectable in healthy 
tissue but will be detectable in telomerase-positive cancer or other telomerase-positive cells. 
Typically, the amount of hTRT gene product in an elevated sample is at least about five, 
frequently at least about ten, more often at least about 50, and very often at least about 100 to 
1000 times higher than the levels in telomerase-negative control cells or cells from healthy 
tissues in an adult, where the percentage of telomerase-positive normal cells is very low. 

The diagnostic and prognostic methods of the present invention can be 
employed with any cell or tissue type of any origin and can be used to detect an immortal or 
neoplastic cell, or tumor tissue, or cancer, of any origin. Types of cancer that may be 
detected include, but are not limited to, all those listed supra in the discussion of therapeutic 

applications of hTRT. 

The assays of the invention are also useful for monitoring the efficacy of 
therapeutic intervention in patients being treated with anticancer regimens. Anticancer 
regimens that can be monitored include all presently approved treatments (including 
chemotherapy, radiation therapy, and surgery) and also includes treatments to be approved in 
the future, such as telomerase inhibition or activation therapies as described herein. (See, 
e.g., See PCT Publication Nos. 96/01835 and 96/40868 and U.S. Patent No. 5,583,016; all of 
which are incorporated by reference in their entirety). 

In another aspect, the assays described below are useful for detecting certain 
variations in hTRT gene sequence (mutations and heritable hTRT alleles) that are indicative 
of a predilection for cancers or other conditions associated with abnormal regulation of 
telomerase activity (infertility, premature aging). 

3) DIAGNOSIS OF CONDITIONS OTHER THAN CANCER 

In addition to diagnosis of cancers, the assays of the present invention have 
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numerous other applications. The present invention provides reagents and methods/diagnosis 
of conditions or diseases characterized by under- or over-expression of telomerase or hTRT 
gene products in cells. In adults, a low level of telomerase activity is normally found in a 
limited complement of normal human somatic cells, e.g., stem cells, activated lymphocytes 

5 and germ cells, and is absent from other somatic cells. Thus, the detection of hTRT or 

telomerase activity in cells in which it is normally absent or inactive, or detection at abnormal 
(i.e., higher or lower man normal) levels in cells in which hTRT is normally present at a low 
level (such as stem cells, activated lymphocytes and germ cells), can be diagnostic of a 
telomerase-related disease or condition or used to identify or isolate a specific cell type (i.e., 

1 0 to isolate stem cells). Examples of such diseases and conditions include: diseases of cell 
proliferation, immunological disorders, infertility, diseases of immune cell function, 
pregnancy, fetal abnormalities, premature aging, and others. Moreover, the assays of the 
invention are useful for monitoring the effectiveness of therapeutic intervention (including 
but not limited to drugs that modulate telomerase activity) in a patient or in a cell- or animal- 

15 based assay. 

In one aspect, the invention provides assays useful for diagnosing infertility. 
Human germ cells (e.g., spermatogonia cells, their progenitors or descendants) are capable of 
indefinite proliferation and characterized by high telomerase activity. Abnormal levels or 
products or diminished levels of hTRT gene products can result in inadequate or abnormal 

20 production of spermatozoa, leading to infertility or disorders of reproduction. Accordingly, 
the invention provides assays (methods and reagents) for diagnosis and treatment of 
"telomerase-based" reproductive disorders. Similarly, the assays can be used to monitor the 
efficacy of contraceptives (e.g., male contraceptives) that target or indirectly affect sperm 
production (and which would reduce hTRT levels or telomerase activity). 

25 In another aspect, the invention provides assays for analysis of telomerase and 

hTRT levels and function in stem cells, fetal cells, embryonic cells, activated lymphocytes 
and hematopoietic stem cells. For example, assays for hTRT gene product detection can be 
used to monitor immune function generally (e.g., by monitoring the prevalence of activated 
lymphocytes or abundance of progenitor stem cells), to identify or select or isolate activated 

30 lymphocytes or stem cells (based on elevated hTRT levels), and to monitor the efficacy of 
therapeutic interventions targeting these tissues (e.g., immunosuppressive agents or 
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therapeutic attempt to expand a stem cell population). 

The invention also provides assays useful for identification of anti-telomerase 
and anti-TRT immunoglobulins (found in serum from a patient). The materials and assays 
described herein can be used to identify patients in which such autoimmune antibodies are 
5 found, permitting diagnosis and treatment of the condition associated with the 
immunoglobulins. 

4) MONITORING CELLS IN CULTURE 

The assays described herein are also useful for monitoring the expression of 
10 hTRT gene products and characterization of hTRT genes in cells ex vivo or in vitro. Because 
elevated hTRT levels are characteristic of immortalized cells, the assays of the invention can 
be used, for example, to screen for, or identify, immortalized cells or to identify an agent 
capable of mortalizing immortalized cells by inhibiting hTRT expression or function. For 
example, the assay will be useful for identifying cells immortalized by increased expression 
1 5 of hTRT in the cell, e.g., by the expression of a recombinant hTRT or by increased expression 
of an endogenously coded hTRT (e.g., by promoter activation). 

Similarly, these assays may be used to monitor hTRT expression in transgenic 
animals or cells (e.g., yeast or human cells containing an hTRT gene). In particular, the 
effects of certain treatments (e.g., application of known or putative telomerase antagonists) on 
20 the hTRT levels in human and nonhuman cells expressing the hTRT of the invention can be 
used for identifying useful drugs and drug candidates (e.g., telomerase activity-modulating 
drugs). 

B) NORMAL, DIAGNOSTIC, AND PROGNOSTIC VALUES 

25 Assays for the presence or quantity of hTRT gene products may be carried out 

and the results interpreted in a variety of ways, depending on the assay format, the nature of 
the sample being assayed, and the information sought. For example, the steady state 
abundance of hTRT gene products is so low in most human somatic tissues that they are 
undetectable by certain assays. Moreover, there is generally no telomerase activity in the 

30 cells of these tissues, making verification of activity quite easy. Conversely, hTRT protein 
and/or hTRT mRNA or telomerase is sufficiently abundant in other telomerase-positive 
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tissues, e.g., malignant tumors, so that the same can be detected using the same assays. Even 
in those somatic cell types in which low levels of telomerase activity can normally be 
detected (e.g., stem cells, and certain activated hematopoietic system cells), the levels of 
hTRT mRNA and telomerase activity are a small fraction (e.g., estimated at about 1% or less) 
5 of the levels in immortal cells; thus, immortal and mortal cells may be easily distinguished by 
the methods of the present invention. It will be appreciated that, when a "less sensitive" 
assay is used, the mere detection of the hTRT gene product in a biological sample can itself 
be diagnostic, without the requirement for additional analysis. Moreover, although the assays 
described below can be made exquisitely sensitive, they may also, if desired, be made less 

10 sensitive (e.g., through judicious choice of buffers, wash conditions, numbers of rounds of 
amplification, reagents, and/or choice of signal amplifiers). Thus, virtually any assay can be 
designed so that it detects hTRT gene products only in biological samples in which they are 
present at a particular concentration, e.g. a higher concentration than in healthy or other 
control tissue. In this case, any detectable level of hTRT mRNA or protein will be considered 

1 5 elevated in cells from post-natal human somatic tissue (other than hematopoietic cells and 
other stem cells). 

In some cases, however, it will be desirable to establish normal or baseline 
values (or ranges) for hTRT gene product expression levels, particularly when very sensitive 
assays capable of detecting very low levels of hTRT gene products that may be present in 

20 normal somatic cells are used. Normal levels of expression or normal expression products 
can be determined for any particular population, subpopulation, or group of organisms 
according to standard methods well known to those of skill in the art and employing the 
methods and reagents of the invention. Generally, baseline (normal) levels of hTRT protein 
or hTRT mRNA are determined by quantitating the amount of hTRT protein and/or mRNA in 

25 biological samples (e.g., fluids, cells or tissues) obtained from normal (healthy) subjects, e.g., 
a human subject For certain samples and purposes, one may desire to quantitate the amount 
of hTRT gene product on a per cell, or per tumor cell, basis. To determine the cellularity of a 
sample, one may measure the level of a constitutively expressed gene product or other gene 
product expressed at known levels in cells of the type from which the sample was taken. 

30 Alternatively, normal values of hTRT protein or hTRT mRNA can be determined by 

quantitating the amount of hTRT protein/RNA in cells or tissues known to be healthy, which 
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are obtained from the same patient from whom diseased (or possibly diseased) cells are 
collected or from a healthy individual. Alternatively, baseline levels can be defined in some 
cases as the level present in non-immortal human somatic cells in culture. It is possible that 
normal (baseline) values may differ somewhat between different cell types (for example, 
5 hTRT mRNA levels will be higher in testis than kidney), or according to the age, sex, or 
physical condition of a patient. Thus, for example, when an assay is used to determine 
changes in hTRT levels associated with cancer, the cells used to determine the normal range 
of hTRT gene product expression can be cells from persons of the same or a different age, 
depending on the nature of the inquiry. Application of standard statistical methods used in 
1 0 molecular genetics permits determination of baseline levels of expression, as well as permits 
identification of significant deviations from such baseline levels. 

In carrying out the diagnostic and prognostic methods of the invention, as 
described above, it will sometimes be useful to refer to "diagnostic" and "prognostic values." 
As used herein, "diagnostic value" refers to a value that is determined for the hTRT gene 
1 5 product detected in a sample which, when compared to a normal (or "baseline") range of the 
hTRT gene product is indicative of the presence of a disease. The disease may be 
characterized by high telomerase activity (e.g., cancer), the absence of telomerase activity 
(e.g., infertility), or some intermediate value. "Prognostic value" refers to an amount of the 
hTRT gene product detected in a given cell type (e.g., malignant tumor cell) that is consistent 
20 with a particular diagnosis and prognosis for the disease (e.g., cancer). The amount 

(including a zero amount) of the hTRT gene product detected in a sample is compared to the 
prognostic value for the cell such that the relative comparison of the values indicates the 
presence of disease or the likely outcome of the disease (e.g., cancer) progression. In one 
embodiment, for example, to assess tumor prognosis, data are collected to obtain a 
25 statistically significant correlation of hTRT levels with different tumor classes or grades. A 
predetermined range of hTRT levels is established for the same cell or tissue sample obtained 
from subjects having known clinical outcomes. A sufficient number of measurements is 
made to produce a statistically significant value (or range of values) to which a comparison 
will be made. The predetermined range of hTRT levels or activity for a given cell or tissue 
30 sample can then be used to determine a value or range for the level of hTRT gene product that 
would correlate to favorable (or less unfavorable) prognosis (e.g., a "low level" in the case of 
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cancer). A range corresponding to a "high level" correlated to an (or a more) unfavorable 
prognosis in the case of cancer can similarly be determined. The level of hTRT gene product 
from a biological sample (e.g., a patient sample) can then be determined and compared to the 
low and high ranges and used to predict a clinical outcome. 
5 Although the discussion above refers to cancer for illustration, it will be 

understood that diagnostic and prognostic values can also be determined for other diseases 
(e.g., diseases of cell proliferation) and conditions and that, for diseases or conditions other 
than cancer, a "high" level may be correlated with the desired outcome and a "low" level 
correlated with an unfavorable outcome. For example, some diseases may be characterized 

10 by a deficiency (e.g., low level) of telomerase activity in stem cells, activated lymphocytes, or 
germline cells. In such cases, "high" levels of hTRT gene products relative to cells of similar 
age and/or type (e.g., from other patients or other tissues in a particular patient) may be 
correlated with a favorable outcome. 

It will be appreciated that the assay methods do not necessarily require 

1 5 measurement of absolute values of hTRT, unless it is so desired, because relative values are 
sufficient for many applications of the methods of the present invention. Where quantitation 
is desirable, the present invention provides reagents such that virtually any known method for 
quantitating gene products can be used. 

The assays of the invention may also be used to evaluate the efficacy of a 

20 particular therapeutic treatment regime in animal studies, in clinical trials, or in monitoring 
the treatment of an individual patient. In these cases, it may be desirable to establish the 
baseline for the patient prior to commencing therapy and to repeat the assays one or more 
times through the course of treatment, usually on a regular basis, to evaluate whether hTRT 
levels are moving toward the desired endpoint (e.g., reduced expression of hTRT when the 

25 assay is for cancer) as a result of the treatment. 

One of skill will appreciate that, in addition to the quantity or abundance of 
hTRT gene products, variant or abnormal expression patterns (e.g., abnormal amounts of 
RNA splicing variants) or variant or abnormal expression products (e.g., mutated transcripts, 
truncated or non-sense polypeptides) may also be identified by comparison to normal 

30 expression levels and normal expression products. In these cases determination of "normal" 
or "baseline" involves identifying healthy organisms and/or tissues (Le. organisms and/or 
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tissues without hTRT expression disregulation or neoplastic growth) and measuring 
expression levels of the variant hTRT gene products (e.g., splicing variants), or sequencing or 
detecting the hTRT gene, mRNA, or reverse transcribed cDNA to obtain or detect typical 
(normal) sequence variations. Application of standard statistical methods used in molecular 
5 genetics permits determination of significant deviations from such baseline levels. 

C) DETECTION AND QUANTITATION OF TRT GENE PRODUCTS 

As has been emphasized herein, hTRT gene products are usually found in 
most normal somatic cells at extremely low levels. For example, the mRNA encoding hTRT 

1 0 protein is extremely rare or absent in all telomerase-negative cell types studied thus far. In 
immortal cells, such as 293 cells, hTRT mRNA may be present at only about 100 copies per 
cell, while normal somatic cells may have as few as one or zero copies per cell. It will thus 
be apparent that, when highly sensitive assays for hTRT gene products are desired, it will 
sometimes be advantageous to incorporate signal or target amplification technologies into the 

15 assay format. See, for example, Plenat et al., 1997, Ann. PathoL 17:17 

(fluoresceinyl-tyramide signal amplification); Zehbeetal., 1997, J. PathoL 150:1553 
(catalyzed reporter deposition); other references listed herein (e.g., for bDNA signal 
amplification, for PCR and other target amplification formats); and other techniques known 
in the art. 

20 As noted above, it is often unnecessary to quantitate the hTRT mRNA or 

protein in the assays disclosed herein, because the detection of an hTRT gene product (under 
assay conditions in which the product is not detectable in control, e.g., telomerase-negative 
cells) is in itself sufficient for a diagnosis. As another example, when the levels of product 
found in a test (e.g., tumor) and control (e.g., healthy cell) samples are directly compared, 

25 quantitation may be superfluous. 

When desired, however, quantities of hTRT gene product measured in the 
assays described herein may be described in a variety of ways, depending on the method of 
measurement and convenience. Thus, normal, diagnostic, prognostic, high or low quantities 
of hTRT protein/mRNA may be expressed as standard units of weight per quantity of 

30 biological sample (e.g., picograms per gram tissue, picograms per 10 12 cells), as a number of 
molecules per quantity of biological sample (e.g., transcripts/cell, moles/cell), as units of 
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activity per cell or per other unit quantity, or by similar methods. The quantity of hTRT gene 
product can also be expressed in relation to the quantity of another molecule; examples 
include: number of hTRT transcripts in sample/number of 28S rRNA transcripts in sample; 
nanograms of hTRT protein/ nanograms of total protein; and the like. 

When measuring hTRT gene products in two (or more) different samples, it 
will sometimes be useful to have a common basis of comparison for the two samples. For 
example, when comparing a sample of normal tissue and a sample of cancerous tissue, equal 
amounts of tissue (by weight, volume, number of cells, etc.) can be compared. Alternatively, 
equivalents of a marker molecule (e.g., 28S rRNA, hTR, telomerase activity, telomere length, 
actin) may be used. For example, the amount of hTRT protein in a healthy tissue sample 
containing 10 picograms of 28S rRNA can be compared to a sample of diseased tissue 
containing the same amount of 28 S rRNA. 

It will also be recognized by those of skill that virtually any of the assays 
described herein can be designed to be quantitative. Typically, a known quantity or source of 
an hTRT gene product (e.g., produced using the methods and compositions of the invention) 
is used to calibrate the assay. 

In certain embodiments, assay formats are chosen that detect the presence, 
absence, or abundance of an hTRT allele or gene product in each cell in a sample (or in a 
representative sampling). Examples of such formats include those that detect a signal by 
histology (e.g., immunohistochemistry with signal-enhancing or target-enhancing 
amplification steps) or fluorescence-activated cell analysis or cell sorting (FACS). These 
formats are particularly advantageous when dealing with a highly heterogeneous cell 
population (e.g., containing multiple cells types in which only one or a few types have 
elevated hTRT levels, or a population of similar cells expressing telomerase at different 
levels). 

D) SAMPLE COLLECTION 

The hTRT gene or gene product (i.e., mRNA or polypeptide) is preferably 
detected and/or quantified in a biological sample. Such samples include, but are not limited 
to, cells (including whole cells, cell fractions, cell extracts, and cultured cells or cell lines), 
tissues (including blood, blood cells (e.g., white cells), and tissue samples such as fine needle 
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biopsy samples (e.g., from prostate, breast, thyroid, etc,)\ body fluids (e.g., urine, sputum, 
amniotic fluid, blood, peritoneal fluid, pleural fluid, semen) or cells collected therefrom (e.g., 
bladder cells from urine, lymphocytes from blood), media (from cultured cells or cell lines), 
and washes (e.g., of bladder and lung). Biological samples may also include sections of 
5 tissues such as frozen sections taken for histological purposes. For cancer diagnosis and 
prognosis, a sample will be obtained from a cancerous or precancerous or suspected 
cancerous tissue or tumor. It will sometimes be desirable to freeze a biological sample for 
later analysis (e.g., when monitoring efficacy of drug treatments). 

In some cases, the cells or tissues may be fractionated before analysis. For 

10 example, in a tissue biopsy from a patient, a cell sorter (e.g., a fluorescence-activated cell 
sorter) may be used to sort cells according to characteristics such as expression of a surface 
antigen (e.g., a tumor specific antigen) according to well known methods. 

Although the sample is typically taken from a human patient or cell line, the 
assays can be used to detect hTRT homolog genes or gene products in samples from other 

15 animals. Alternatively, hTRT genes and gene products can be assayed in transgenic animals 
or organisms expressing a human TRT protein or nucleic acid sequence. 

The sample may be pretreated as necessary by dilution in an appropriate 
buffer solution or concentrated, if desired. Any of a number of standard aqueous buffer 
solutions, employing one of a variety of buffers, such as phosphate, Tris-buffer, or the like, at 

20 physiological pH can be used. 

A "biological sample" obtained from a patient can be referred to either as a 
"biological sample" or a "patient sample." It will be appreciated that analysis of a "patient 
sample" need not necessarily require removal of cells or tissue from the patient. For 
example, appropriately labeled hTRT-binding agents (e.g., antibodies or nucleic acids) can be 

25 injected into a patient and visualized (when bound to the target) using standard imaging 
technology (e.g., CAT, NMR, and the like.) 

E) NUCLEIC ACID ASSAYS 

In one embodiment, this invention provides for methods of detecting and/or 
30 quantifying expression of hTRT mRNAs (including splicing or sequence variants and 
alternative alleles). In an alternative embodiment, the invention provides methods for 
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detecting and analyzing normal or abnormal hTRT genes (or fragments thereof). The form of 
such qualitative or quantitative assays may include, but is not limited to, amplification-based 
assays with or without signal amplification, hybridization based assays, and combination 
amplification-hybridization assays. It will be appreciated by those of skill that the distinction 
5 between hybridization and amplification is for convenience only: as illustrated in the 
examples below, many assay formats involve elements of both hybridization and 
amplification, so that the categorization is somewhat arbitrary in some cases. 

1) PREPARATION OF NUCLEIC ACIDS 

10 In some embodiments, nucleic acid assays are performed with a sample of 

nucleic acid isolated from the cell, tissue, organism, or cell line to be tested. The nucleic acid 
(e.g., genomic DNA, RNA or cDNA) may be "isolated" from the sample according to any of 
a number of methods well known to those of skill in the art. In this context, "isolated" refers 
to any separation of the species or target to be detected from any other substance in the 

1 5 mixture, but does not necessarily indicate a significant degree of purification of the target. 
One of skill will appreciate that, where alterations in the copy number of the hTRT gene are 
to be detected, genomic DNA is the target to be detected. Conversely, where expression 
levels of a gene or genes are to be detected, RNA is the target to be detected in a nucleic 
acid-based assay. In one preferred embodiment, the nucleic acid sample is the total mRNA 

20 (i.e., poly(A) + RNA) in a biological sample. Methods for isolating nucleic acids are well 
known to those of skill in the art and are described, for example, Tijssen, P. ed. of 
Laboratory Techniques in Biochemistry and Molecular Biology: Hybridization 
With Nucleic Acid Probes, Part I. Theory and Nucleic Acid Preparation, Elsevier, 
N.Y. (1993) Chapt. 3, which is incorporated herein by reference. In one embodiment, the 

25 total nucleic acid is isolated from a given sample using an acid guanidinium-phenol- 
chloroform extraction method and poly(A)+ mRNA is isolated by oligo-dT column 
chromatography or by using (dT)n magnetic beads (see, e.g., Sambrook et al., and Ausubel et 
aL, supra). 

In alternative embodiments, it is not necessary to isolate nucleic acids (e.g., 
30 total or polyA + RNA) from the biological sample prior to carrying out amplification, 

hybridization or other assays. These embodiments have certain advantages when hTRT RNA 
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is to be measured, because they reduce the possibility of loss of hTRT mRNA during 
isolation and handling. For example, many amplification techniques such as PGR and RT- 
PCR defined above can be carried out using permeabilized cells (histological specimens and 
FACS analyses), whole lysed cells, or crude cell fractions such as certain cell extracts. 
5 Preferably, steps are taken to preserve the integrity of the target nucleic acid (e.g., mRNA) if 
necessary (e.g., addition of RNAase inhibitors). Amplification and hybridization assays can 
also be carried out in situ, for example, in thin tissue sections from a biopsy sample or from a 
cell monolayer (e.g., blood cells or disagregated tissue culture cells). Amplification can also 
be carried out in an intact whole cell or fixed cells. For example, PCR, RT-PCR, or LCR 

10 amplification methods may be carrier out, as is well known in the art, in situ, e.g., using a 
polymerase or ligase, a primer or primer(s), and (deoxy)ribonucleoside triphosphates (if a 
polymerase is employed), and reverse transcriptase and primer (if RNA is to be transcribed 
and the cDNA is to be detected) on fixed, permeabilized, or microinjected cells to amplify 
target hTRT RNA or DNA. Cells containing hTRT RNA (e.g., telomerase positive cells) or 

15 an hTRT DNA sequence of interest can then be detected. This method is often useful when 
fluorescently-labeled dNTPs, primers, or other components are used in conjunction with 
microscopy, FACS analysis or the equivalent. 



2) AMPLIFICATION BASED ASSAYS 
20 In one embodiment, the assays of the present invention are 

amplification-based assays for detection of an hTRT gene or gene product. In an 
amplification based assay, all or part of an hTRT gene or transcript {e.g., mRNA or cDNA; 
hereinafter also referred to as "target") is amplified, and the amplification product is then 
detected directly or indirectly. When there is no underlying gene or gene product to act as a 
25 template, no amplification product is produced (e.g., of the expected size), or amplification is 
non-specific and typically there is no single amplification product. In contrast, when the 
underlying gene or gene product is present, the target sequence is amplified, providing an 
indication of the presence and/or quantity of the underlying gene or mRNA. Target 
amplification-based assays are well known to those of skill in the art. 
30 The present invention provides a wide variety of primers and probes for 

detecting hTRT genes and gene products. Such primers and probes are sufficiently 
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complementary to the hTRT gene or gene product to hybridize to the target nucleic acid. 
Primers are typically at least 6 bases in length, usually between about 10 and about 100 bases, 
typically between about 12 and about 50 bases, and often between about 14 and about 25 
bases in length. One of skill, having reviewed the present disclosure, will be able, using 
5 routine methods, to select primers to amplify all, or any portion, of the hTRT gene or gene 
product, or to distinguish between variant gene products, hTRT alleles, and the like. Table 2 
lists illustrative primers useful for PCR amplification of the hTRT, or specific hTRT gene 
products or regions. As is known in the art, single oligomers (e.g., U.S. Pat No. 5,545,522), 
nested sets of oligomers, or even a degenerate pool of oligomers may be employed for 
1 0 amplification, e.g., as illustrated by the amplification of the Tetrahymena TRT cDNA as 
described infra. 

The invention provides a variety of methods for amplifying and detecting an 
hTRT gene or gene product, including the polymerase chain reaction (including all variants, 
e.g,, reverse-transcriptase-PCR; the Sunrise Amplification System (Oncor, Inc, Gaithersburg 

1 5 MD); and numerous others known in the art). In one illustrative embodiment, PCR 

amplification is carried out in a 50 nl solution containing the nucleic acid sample (e.g., cDNA 
obtained through reverse transcription of hTRT RNA), 100 ^iM in each dNTP (dATP, dCTP, 
dGTP and dTTP; Pharmacia LKB Biotechnology, NJ), the hTRT-specific PCR primer(s), 1 
unit/ Taq polymerase (Perkin Elmer, Norwalk CT), lx PCR buffer (50 mM KC1, 10 mM Tris, 

20 pH 8.3 at room temperature, 1 .5 mM MgCl 2 , 0.01% gelatin) with the amplification run for 
about 30 cycles at 94° for 45 sec, 55° for 45 sec and 72° for 90 sec. However, as will be 
appreciated, numerous variations may be made to optimize the PCR amplification for any 
particular reaction. 

Other suitable target amplification methods include the ligase chain reaction 
25 (LCR;e.g.,Wu and Wallace, 1989, Genomics 4:560; Landegren etaL, 1988, Science, 241: 
1077, Barany, 1991, Proc. Natl. Acad ScL USA 88:189 and Barringer et al, 1990, Gene, 89: 
117); strand displacement amplification (SDA; e.g., Walker et al., 1992, Proc. Natl Acad. 
ScL U.S.A. 89:392-396); transcription amplification (e.g., Kwoh etaL, 1989, Proc. Natl 
Acad. Sci. USA, 86: 1173); self-sustained sequence replication (3SR; e.g., Fahy et al., 1992, 
30 PCR Methods Appl 1:25, Guatelli et al., 1990, Proc. Nat. Acad. Sci. USA S 87: 1874); the 
nucleic acid sequence based amplification (NASBA, Cangene, Mississauga, Ontario; e.g., 
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Compton, 1991, Nature 350:91); the transcription-based amplification system (TAS); and the 
self-sustained sequence replication system (SSR). Each of the aforementioned publications is 
incorporated herein by reference. One useful variant of PCR is PCR ELISA (e.g., Boehringer 
Mannheim Cat. No. 1 636 1 1 1) in which digoxigenin-dUTP is incorporated into the PCR 

5 product. The PCR reaction mixture is denatured and hybridized with a biotin-labeled 
oligonucleotide designed to anneal to an internal sequence of the PCR product. The 
hybridization products are immobilized on streptavidin coated plates and detected using anti- 
digoxigenin antibodies. Examples of techniques sufficient to direct persons of skill through 
in vitro amplification methods are found in PCR Technology: Principles and 

1 0 Applications for DNA Amplification, H. Erlich, Ed. Freeman Press, New York, NY 
(1 992); PCR Protocols: A Guide to Methods and Applications, eds. Innis, Gelfland, 
Snisky, and White, Academic Press, San Diego, CA (1990); Mattila et al., 1991, Nucleic 
Acids Res. 19: 4967; Eckert and Kunkel, (1991) PCR Methods and Applications 1:17; 
PCR, eds. McPherson, Quirkes, and Taylor, IRL Press, Oxford; U.S. Patent Nos. 4,683,195, 

15 4,683,202, and 4,965,1 88; Barringer et al., 1990, Gene, 89:117; Lomell et al., 1989,J. Clin. 
Chern., 35:1826, each of which is incorporated herein for all purposes. 

Amplified products may be directly analyzed, e.g., by size as determined by 
gel electrophoresis; by hybridization to a target nucleic acid immobilized on a solid support 
such as a bead, membrane, slide, or chip; by sequencing; immunologically, e.g., by PCR- 

20 ELISA, by detection of a fluorescent, phosphorescent, or radioactive signal; or by any of a 
variety of other well-known means. For example, an illustrative example of a detection 
method uses PCR primers augmented with hairpin loops linked to fluorescein and a benzoic 
acid derivative that serves as a quencher, such that fluorescence is emitted only when the 
primers unfold to bind their targets and replication occurs. 

25 Because hTRT mRNA is typically expressed as an extremely rare transcript, 

present at very low levels even in telomerase positive cells, it is often desirable to optimize or 
increase the signal resulting from the amplification step. One way to do this is to increase the 
number of cycles of amplification. For example, although 20-25 cycles are adequate for 
amplification of most mRNAs using the polymerase chain reaction under standard reaction 

30 conditions, detection of hTRT mRNA in many samples can require as many as 30 to 35 
cycles of amplification, depending on detection format and efficiency of amplification. It 
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will be recognized that judicious choice of the amplification conditions including the number 
of amplification cycles can be used to design an assay that results in an amplification product 
only when there is a threshold amount of target in the test sample (i.e., so that only samples 
with a high level of hTRT mRNA give a "positive" result). In addition, methods are known 

5 to increase signal produced by amplification of the target sequence. Methods for augmenting 
the ability to detect the amplified target include signal amplification system such as: branched 
DNA signal amplification (e.g., U.S. Pat. No. 5,124,246; Urdea, 1994, Bio/Tech 12:926); 
tyramide signal amplification (TSA) system (Du Pont); catalytic signal amplification (CSA; 
Dako); Q Beta Replicase systems (Tyagi et al, 1996, Proc. Nat Acad ScL USA, 93: 5395 ); 

10 or the like. 

One of skill in the art will appreciate that whatever amplification method is 
used, a variety of quantitative methods known in the art can be used if quantitation is desired. 
For example, when desired, two or more polynucleotides can be co-amplified in a single 
sample. This method can be used as a convenient method of quantitating the amount of 

1 5 hTRT mRNA in a sample, because the reverse transcription and amplification reactions are 
carried out in the same reaction for a target and control polynucleotide. The co-amplification 
of the control polynucleotide (usually present at a known concentration or copy number) can 
be used for normalization to the cell number in the sample as compared to the amount of 
hTRT in the sample. Suitable control polynucleotides for co-amplification reactions include 

20 DNA, RNA expressed from housekeeping genes, constitutively expressed genes, and in vitro 
synthesized RNAs or DNAs added to the reaction mixture. Endogenous control 
polynucleotides are those that are already present in the sample, while exogenous control 
polynucleotides are added to a sample, creating a "spiked" reaction. Illustrative control 
RNAs include p-actin RNA, GAPDH RNA, snRNAs, hTR, and endogenously expressed 28S 

25 rRNA (see Khan et al 9 1992, Neurosci, Lett. 147: 1 14). Exogenous control polynucleotides 
include a synthetic AW106 cRNA, which may be synthesized as a sense strand from 
pAW106 by T7 polymerase. It will be appreciated that for the co-amplification method to be 
useful for quantitation, the control and target polynucleotides must typically both be 
amplified in a linear range. Detailed protocols for quantitative PCR may be found in PCR 

30 Protocols, A Guide to Methods and Applications, Innis et aL, Academic Press, Inc. 
N.Y., (1990) and Ausubel et al., supra (Unit 15) and Diaco, R. (1995) Practical 
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Considerations for the Design of Quantitative PCR Assays, in PCR Strategies, pg. 84-108, 
Innis et al. eds, Academic Press, New York. 

Depending on the sequence of the endogenous or exogenous standard, 
different primer sets may be used for the co-amplification reaction. In one method, called 

5 competitive amplification, quantitative PCR involves simultaneously co-amplifying a known 
quantity of a control sequence using the same primers used for amplification of the target 
nucleic acid (one pair of 2 primers). In an alternative embodiment, known as non- 
competitive competition, the control sequence and the target sequence (e.g., hTRT cDNA) are 
amplified using different primers (i.e., 2 pairs of 2 primers). In another alternative 

10 embodiment, called semi-competitive amplification, three primers are used, one of which is 
hTRT-specific, one of which is control specific, and one of which is capable of annealing to 
both the target and control sequences. Semi-competitive amplification is described in U.S. 
Patent No. 5,629,154, which is incorporated herein by reference. 

1 5 3) HYBRIDIZATION-BASED ASSAYS 

a) GENERALLY 

A variety of methods for specific DNA and RNA measurement using nucleic 
acid hybridization techniques are known to those of skill in the art (see Sambrook et al., 
supra). Hybridization based assays refer to assays in which a probe nucleic acid is hybridized 

20 to a target nucleic acid. Usually the nucleic acid hybridization probes of the invention are 
entirely or substantially identical to a contiguous sequence of the hTRT gene or RNA 
sequence. Preferably, nucleic acid probes are at least about 10 bases, often at least about 20 
bases, and sometimes at least about 200 bases or more in length. Methods of selecting 
nucleic acid probe sequences for use in nucleic acid hybridization are discussed in Sambrook 

25 et al., supra. In some formats, at least one of the target and probe is immobilized. The 

immobilized nucleic acid may be DNA, RNA, or another oligo- or poly-nucleotide, and may 
comprise natural or non-naturally occurring nucleotides, nucleotide analogs, or backbones. 
Such assays may be in any of several formats including: Southern, Northern, dot and slot 
blots, high-density polynucleotide or oligonucleotide arrays (e.g., GeneChips™ Affymetrix), 

30 dip sticks, pins, chips, or beads. All of these techniques are well known in the art and are the 
basis of many commercially available diagnostic kits. Hybridization techniques are generally 
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described in Haines et aL, ed., Nucleic Acid Hybridization, A Practical Approach IRL 
Press, (1985); Gall and Pardue Proc. Natl Acad. ScL, U.S.A., 63: 378-383 (1969); and John 
et aL, Nature, 223: 582-587 (1969). 

A variety of nucleic acid hybridization formats are known to those skilled in 

5 the art. For example, one common format is direct hybridization, in which a target nucleic 
acid is hybridized to a labeled, complementary probe. Typically, labeled nucleic acids are 
used for hybridization, with the label providing the detectable signal. One method for 
evaluating the presence, absence, or quantity of hTRT mRNA is carrying out a Northern 
transfer of RNA from a sample and hybridization of a labeled hTRT specific nucleic acid 

1 0 probe, as illustrated in Example 2. As was noted supra, hTRT mRNA, when present at all, is 
present in very low quantities in most cells. Therefore, when Northern hybridization is used, 
it will often be desirable to use an amplification step (or, alternatively, large amounts of 
starting RNA). A useful method for evaluating the presence, absence, or quantity of DNA 
encoding hTRT proteins in a sample involves a Southern transfer of DNA from a sample and 

1 5 hybridization of a labeled hTRT specific nucleic acid probe. 

Other common hybridization formats include sandwich assays and 
competition or displacement assays. Sandwich assays are commercially useful hybridization 
assays for detecting or isolating nucleic acid sequences. Such assays utilize a "capture" 
nucleic acid covalently immobilized to a solid support and a labeled "signal" nucleic acid in 

20 solution. The biological or clinical sample will provide the target nucleic acid. The "capture" 
nucleic acid and "signal" nucleic acid probe hybridize with the target nucleic acid to form a 
"sandwich" hybridization complex. To be effective, the signal nucleic acid cannot hybridize 
with the capture nucleic acid. 

25 b) CHIP-BASED AND SLIDE-BASED ASSAYS 

The present invention also provides probe-based hybridization assays for 
hTRT gene products employing arrays of immobilized oligonucleotide or polynucleotides to 
which an hTRT nucleic acid can hybridize (i.e., to some, but usually not all or even most, of 
the immobilized oligo- or poly-nucleotides). High density oligonucleotide arrays or 

30 polynucleotide arrays provide a means for efficiently detecting the presence and 

characteristics (e.g., sequence) of a target nucleic acid (e.g., hTRT gene, mRNA, or cDNA). 
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Techniques are known for producing arrays containing thousands of oligonucleotides 
complementary to defined sequences, at defined locations on a surface using 
photolithographic techniques for synthesis in situ (see, e.g., U.S. Patent Nos. 5,578,832; 
5,556,752; and 5,510,270; Fodor et ah, 1991, Science 251:767; Pease et al., 1994, Proc. Natl 
Acad. Set USA 91:5022; and Lockhart et al., 1996, Nature Biotech 14:1675) or other methods 
for rapid synthesis and deposition of defined oligonucleotides (Blanchard et al., 1996, 
Biosensors & Bioelectronics 1 1 :687). When these methods are used, oligonucleotides (e.g., 
20-mers) of known sequence are synthesized directly on a surface such as a derivatized glass 
slide. Usually, the array produced is redundant, having several oligonucleotide probes on the 
chip specific for the hTRT polynucleotide to be detected. 

Combinations of oligonucleotide probes can be designed to detect 
alternatively spliced mRNAs, or to identify which of various hTRT alleles is expressed in a 
particular sample. 

In one illustrative embodiment, cDNA prepared by reverse transcription of 
total RNA from a test cell is amplified (e.g., using PCR). Typically the amplification product 
is labeled, e.g., by incorporation of a fluorescently labeled dNTP. The labeled cDNAs are 
then hybridized to a chip comprising oligonucleotide probes complementary to various 
subsequences of the hTRT gene. The positions of hybridization are determined (e.g., in 
accordance with the general methods of Shalon et al, 1996, Genome Research 6:639 or 
Schena et al., 1996, Genome Res. 6:639), and sequence (or other information) deduced from 
the hybridization pattern, by means well known in the art. 

In one embodiment, two cDNA samples, each labeled with a different 
fluorescent group, are hybridized to the same chip. The ratio of the hybridization of each 
labeled sample to sites complementary to the hTRT gene are then assayed. If both samples 
contain the same amount of hTRT mRNA, the ratio of the two fluors will be 1 : 1 (it will be 
appreciated that the signal from the fluors may need to be adjusted to account for any 
difference in the molar sensitivity of the fluors). In contrast, if one sample is from a healthy 
(or control) tissue and the second sample is from a cancerous tissue the fluor used in the 
second sample will predominate. 

c) IN SITU HYBRIDIZATION 
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An alternative means for detecting expression of a gene encoding an hTRT 
protein is in situ hybridization. In situ hybridization assays are well known and are generally 
described in Angerer et al., METHODS ENZYMOL., 152: 649-660 (1987) and Ausubel et al., 
supra. In an in situ hybridization assay, cells or tissue specimens are fixed to a solid support, 
typically in a permeablilized state, typically on a glass slide. The cells are then contacted 
with a hybridization solution at a moderate temperature to permit annealing of labeled nucleic 
acid probes (e.g., 35 S-labeled riboprobes, fluorescently labeled probes) completely or 
substantially complementary to hTRT. Free probe is removed by washing and/or nuclease 
digestion, and bound probe is visualized directly on the slide by autoradiography or an 
appropriate imaging techniques, as is known in the art. 

4) SPECIFIC DETECTION OF VARIANTS 

As noted supra and illustrated in the Examples (e.g., Example 9), 
amplification primers or probes can be selected to provide amplification products that span 
specific deletions, truncations, and insertions, thereby facilitating the detection of specific 
variants or abnormalities in the hTRT mRNA. 

One example of an hTRT variant gene product that may be detected is an 
hTRT RNA such as a product (SEQUENCE ID NO: 4) described supra and in Example 9. 
The biological function, if any, of the A 182 variant(s) is not known; however, the truncated 
hTRT protein putatively encoded by the variant may be involved in regulation of telomerase 
activity, e.g., by assembling a non-functional telomerase RNP that titrates telomerase 
components. Alternatively, negative regulation of telomerase activity could be accomplished 
by directing hTRT pre-mRNA (nascent mRNA) processing in a manner leading to 
elimination of the full length mRNA and reducing hTRT mRNA levels and increasing Al 82 
hTRT RNA levels. For these and other reasons, the ability to detect Al 82 variants is useful. 
In addition, it will sometimes be desirable, in samples in which two species of hTRT RNA 
are present (such as a A182 hTRT RNA and hTRT RNA encoding the full-length hTRT 
protein) to compare their relative and/or absolute abundance. 

The invention provides a variety of methods for detection of Al 82 variants. 
For example, amplification using primer pairs spanning the 182 basepair deletion will result 
in different sized products corresponding to the deleted and undeleted hTRT RNAs, if both 
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are present, which can be distinguished on the basis of size (e.g., by gel electrophoresis). 
Examples of primer pairs useful for amplifying the region spanning the 182 bp deletion 
include TCP1.14 and TCP1.15 (primer set 1), or TCP1.25 and bTCP6 (primer set 2) (see 
Table 2). These primer pairs can be used individually or in a nested PCR experiment where 
5 primer set 1 is used first. It will also be apparent to one of skill that hybridization methods 
(e.g., Northern hybridization) or RNAse protection assays using an hTRT nucleic acid probe 
of the invention can be used to detect and distinguish hTRT RNA variants. 

Another suitable method entails PCR amplification (or the equivalent) using 
three primers. Analogous to the semi-competitive quantitative PCR method described in 
10 greater detail supra, one primer is specific to each of the hTRT RNA species (e.g., as 

illustrated in Table 4) and one primer is complementary to both species (e.g., TCP 1.25 (2270- 
2288)). An example of a primer specific to SEQUENCE ID NO:. 1 is one that anneals within 
the 182 nucleotide sequence (i.e., nucleotides 2345 to 2526 of SEQUENCE ID NO: 1), e.g., 
TCP1. 73 (2465-2445). For example, a primer specific to SEQUENCE ID NO: 4 (aA182 
1 5 variant) is one that anneals at nucleotides 2358 to 2339 of SEQUENCE ID NO: 4 (i.e., the 
site corresponding to the 182 nucleotide insertion in SEQUENCE ID NO: 1). The absolute 
abundance of the A 182 hTRT mRNA species or its relative abundance compared to the 
species encoding the full-length hTRT protein can be analyzed for correlation to cell state 
(e.g., capacity for indefinite proliferation). It will be appreciated that numerous other primers 
20 or amplification or detection methods can be selected based on the present disclosure. 

TABLE 4 
ILLUSTRATIVE PRIMERS 

A182 species (e.g., SEQUENCE ID NO. 4) specific primer: 
25 5 1 -GGCACTGGACGTAGGACGTG-3 

hTRT (SEQUENCE ID NO. 1) specific primer (TCP1.73): 

5 ' - CACTGCTGGCCTCATTCAGGG- 3 
Common (forward) primer (TCP1.25) : 

5 1 -TACTGCGTGCGTCGGTATG- 3 ' 
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Other variant hTRT genes or gene products that can be detected include those 
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characterized by premature stop codons, deletions, substitutions or insertions. Deletions can 
be detected by the decreased size of the gene, mRNA transcript, or cDNA. Similarly, 
insertions can be detected by the increased size of the gene, mRNA transcript, or cDNA 
Insertions and deletions could also cause shifts in the reading frame that lead to premature 

5 stop codons or longer open reading frames. Substitutions, deletions, and insertions can also 
be detected by probe hybridization. Alterations can also be detected by observing changes in 
the size of the variant hTRT polypeptide (e.g., by Western analysis) or by hybridization or 
specific amplification as appropriate. Alternatively, mutations can be determined by 
sequencing of the gene or gene product according to standard methods. In addition, and as 

10 noted above, amplification assays and hybridization probes can be selected to target particular 
abnormalities specifically. For example, nucleic acid probes or amplification primers can be 
selected that specifically hybridize to or amplify, respectively, the region encompassing the 
deletion, substitution, or insertion. Where the hTRT gene harbors such a mutation, the probe 
will either (1) fail to hybridize or the amplification reaction will fail to provide specific 

1 5 amplification or cause a change in the size of the amplification product or hybridization 

signal; or (2) the probe or amplification reaction encompasses the entire deletion or either end 
of the deletion (deletion junction); or (3) similarly, probes and amplification primers can be 
selected that specifically target point mutations or insertions. 

20 5) DETECTION OF MUTANT hTRT ALLELES 

Mutations in the hTRT gene can be responsible for disease initiation or can 
contribute to a disease condition. Alterations of the genomic DNA of hTRT can affect levels 
of gene transcription, change amino acid residues in the hTRT protein, cause truncated hTRT 
polypeptides to be produced, alter pre-mRNA processing pathways (which can alter hTRT 

25 mRNA levels), and cause other consequences as well. 

Alterations of genomic DNA in non-hTRT loci can also affect expression of 
hTRT or telomerase by altering the enzymes or cellular processes that are responsible for 
regulating hTRT, hTR, and telomerase-associated protein expression and processing and RNP 
assembly and transport. Alterations which affect hTRT expression, processing, or RNP 

30 assembly could be important for cancer progression, for diseases of aging, for DNA damage 
diseases, and others. 
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Detection of mutations in hTRT mRNA or its gene and gene control elements 
can be accomplished in accordance with the methods herein in multiple ways. Illustrative 
examples include the following: A technique termed primer screening can be employed; 
PCR primers are designed whose 3' termini anneal to nucleotides in a sample DNA (or RNA) 
5 that are possibly mutated. If the DNA (or RNA) is amplified by the primers, then the 3' 
termini matched the nucleotides in the gene; if the DNA is not amplified, then one or both 
termini did not match the nucleotides in the gene, indicating a mutation was present. Similar 
primer design can be used to assay for point mutations using the Ligase Chain Reaction 
(LCR, described supra). Restriction fragment length polymorphism, RFLP (Pourzand, C, 
10 Cerutti, P. (1993) Mutat. Res 288: 1 13-121), is another technique that can be applied in the 
present method. A Southern blot of human genomic DNA digested with various restriction 
enzymes is probed with an hTRT specific probe. Differences in the fragment number or sizes 
between the sample and a control indicate an alteration of the experimental sample, usually 
an insertion or deletion. Single strand conformation polymorphism, SSCP (Orrita, M., et al. 
15 (1989) PNAS USA 86:2766-70), is another technique that can be applied in the present 
method. SSCP is based on the differential migration of denatured wild-type and mutant 
single-stranded DNA (usually generated by PCR). Single-stranded DNA will take on a 
three-dimensional conformation that is sequence-specific. Sequence differences as small as a 
single base change can result in a mobility shift on a nondenaturing gel. SSCP is one of the 
20 most widely used mutation screening methods because of its simplicity. Denaturing Gradient 
Gel Electrophoresis, DGGE (Myers, R. M., Maniatis, T. and Lerman, L., (1987) Methods in 
Enzymology, 155: 501-527), is another technique that can be applied in the present method. 
DGGE identifies mutations based on the melting behavior of double-stranded DNA. 
Specialized denaturing electrophoresis equipment is utilized to observe the melting profile of 
25 experimental and control DNAs: a DNA containing a mutation will have a different mobility 
compared to the control in these gel systems. The examples discussed illustrate commonly 
employed methodology; many other techniques exist which are known by those skilled in the 
art and can be applied in accordance with the teachings herein. 

30 F. KARYOTYPE ANALYSIS 

The present invention further provides methods and reagents for karyotype or 
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other chromosomal analysis using hTRT-sequence probes and/or detecting or locating hTRT 
gene sequences in chromosomes from a human patient, human cell line, or non-human cell. 
In one embodiment, amplification (i.e., change in copy number), deletion (i.e., partial 
deletion), insertion, substitution, or changes in the chromosomal location (e.g., translocation) 
5 of an hTRT gene may be correlated with the presence of a pathological condition or a 
predisposition to developing a pathological condition (e.g., cancer). 

It has been determined by the present inventors that, in normal human cells, 
the hTRT gene maps close to the telomere of chromosome 5p (see Example 5, infra). The 
closest STS marker is D5S678 (see Figure 8). The location can be used to identify markers 
10 that are closely linked to the hTRT gene. The markers can be used to identify YACs, STSs, 
cosmids, BACs, lambda or PI phage, or other clones which contain hTRT genomic 
sequences or control elements. The markers or the gene location can be used to scan human 
tissue samples for alterations in the normal hTRT gene location, organization or sequence that 
is associated with the occurrence of a type of cancer or disease. This information can be used 
15 in a diagnostic or prognostic manner for the disease or cancer involved. Moreover, the nature 
of any alterations to the hTRT gene can be informative as to the nature by which cells 
become immortal. For instance, a translocation event could indicate that activation of hTRT 
expression occurs in some cases by replacing the hTRT promoter with another promoter 
which directs hTRT transcription in an inappropriate manner. Methods and reagents of the 
20 invention of this type can be used to inhibit hTRT activation. The location may also be 
useful for determining the nature of hTRT gene repression in normal somatic cells, for 
instance, whether the location is part of non-expressing heterochromatin. Nuclease 
hypersensitivity assays for distinguishing heterochromatin and euchromatin are described, for 
example, in Wu et al., 1979, Cell 16:797; Groudine and Weintraub, 1982, Cell 30:131 Gross 
25 and Garrard, 1988, Ann. Rev. Biochem. 57:159. 

In one embodiment, alterations to the hTRT gene are identified by karyotype 
analysis, using any of a variety of methods known in the art. One useful technique is in 
situ hybridization (ISH). Typically, when in situ hybridization techniques are used for 
karyotype analysis, a detectable or detectably-labeled probe is hybridized to a chromosomal 
30 sample in situ to locate an hTRT gene sequence. Generally, ISH comprises one or more of 
the following steps: (1) fixation of the tissue, cell or other biological structure to be analyzed; 
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(2) prehybridization treatment of the biological structure to increase accessibility of target 
DNA (e.g., denaturation with heat or alkali), and to reduce nonspecific binding (e.g., by 
blocking the hybridization capacity of repetitive sequences, e.g., using human genomic 
DNA); (3) hybridization of one or more nucleic acid probes (e.g., conventional nucleic acids, 
5 PNAs, or probes containing other nucleic acid analogs) to the nucleic acid in the biological 
structure or tissue; (4) posthybridization washes to remove nucleic acid fragments not bound 
in the hybridization; and, (5) detection of the hybridized nucleic acid fragments. The reagents 
used in each of these steps and conditions for their use vary depending on the particular 
application. It will be appreciated that these steps can be modified in a variety of ways well 
1 0 known to those of skill in the art. 

In one embodiment of ISH, the hTRT probe is labeled with a fluorescent label 
(fluorescent in situ hybridization; "FISH"). Typically, it is desirable to use dual color 
fluorescent in situ hybridization, in which two probes are utilized, each labeled by a different 
fluorescent dye. A test probe that hybridizes to the hTRT sequence of interest is labeled with 
1 5 one dye, and a control probe that hybridizes to a different region is labeled with a second dye. 
A nucleic acid that hybridizes to a stable portion of the chromosome of interest, such as the 
centromere region, can be used as the control probe. In this way, one can account for 
differences between efficiency of hybridization from sample to sample. 

The ISH methods for detecting chromosomal abnormalities (e.g., FISH) can be 
20 performed on nanogram quantities of the subject nucleic acids. Paraffin embedded normal 
tissue or tumor sections can be used, as can fresh or frozen material, tissues, or sections. 
Because FISH can be applied to limited material, touch preparations prepared from 
uncultured primary tumors can also be used {see, e.g., Kallioniemi et aL, 1992, Cytogenet. 
Cell Genet 60:190). For instance, small biopsy tissue samples from tumors can be used for 
25 touch preparations {see, e.g., Kallioniemi et al., supra). Small numbers of cells obtained from 
aspiration biopsy or cells in bodily fluids (e.g., blood, urine, sputum and the like) can also be 
analyzed. For prenatal diagnosis, appropriate samples will include amniotic fluid, maternal 
blood, and the like. Useful hybridization protocols applicable to the methods and reagents 
disclosed here are described in Pinkel et al., 1988, Proc. Natl Acad. Sci. USA, 85:9138; EPO 
30 Pub. No. 430,402; Choo, ed., Methods in Molecular Biology Vol. 33: In Situ 

Hybridization Protocols, Humana Press, Totowa, New Jersey, (1994); and Kallioniemi et 
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aL, supra. 

Other techniques useful for karyotype analysis include, for example, 
techniques such as quantitative Southern blotting, quantitative PCR, or comparative genomic 
hybridization (Kallioniemi et aL, 1992, Science, 258:818), using the hTRT probes and 
primers of the invention which may be used to identify amplification, deletion, insertion, 
substitution or other rearrangement of hTRT sequences in chromosomes in a biological 
sample. 

G* TRT POLYPEPTIDE ASSAYS 

1) GENERALLY 

The present invention provides methods and reagents for detecting and 
quantitating hTRT polypeptides. These methods include analytical biochemical methods 
such as electrophoresis, mass spectroscopy, gel shift, capillary electrophoresis, 
chromatographic methods such as size exclusion chromatography, high performance liquid 
chromatography (HPLC), thin layer chromatography (TLC), hyperdiffusion chromatography, 
and the like, or various immunological methods such as fluid or gel precipitin reactions, 
immunodiffusion (single or double), Immunoelectrophoresis, radioimmunoassay (RIA), 
enzyme-linked immunosorbent assays (ELISAs), immunofluorescent assays, western 
blotting, mass spectrometry, and others described below and apparent to those of skill in the 
art upon review of this disclosure, 

2) ELECTROPHORETIC ASSAYS 

In one embodiment, the hTRT polypeptides are detected in an electrophoretic 
protein separation; in one aspect, a two-dimensional electrophoresis system is employed. 
Means of detecting proteins using electrophoretic techniques are well known to those of skill 
in the art {see generally, R. Scopes (1982) Protein Purification, Springer-Verlag, N.Y.; 
Deutscher, (1990) Methods in Enzymology Vol. 182: Guide to Protein Purification, 
Academic Press, Inc., N.Y.). 

In a related embodiment, a mobility shift assay (see, e.g., Ausubel et aL, 
supra) is used. For example, labeled-hTR will associate with hTRT and migrate with altered 
mobility upon electrophoresis in a nondenaturing polyacrylamide gel or the like. Thus, for 
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example, if an (optionally labeled) hTR probe or a (optionally labeled) telomerase primer is 
mixed with a sample containing hTRT, or coexpressed with hTRT (e.g., in a cell-free 
expression system) the presence of hTRT protein (or a polynucleotide encoding hTRT) in the 
sample will result in a detectable alteration of hTR mobility. 

5 

3) IMMUNOASSAYS 

a) GENERALLY 

The present invention also provides methods for detection of hTRT 
polypeptides employing one or more antibody reagents of the invention (i.e., immunoassays). 

10 As used herein, an immunoassay is an assay that utilizes an antibody (as broadly defined 
herein and specifically includes fragments, chimeras and other binding agents) that 
specifically binds an hTRT polypeptide or epitope. Antibodies of the invention may be made 
by a variety of means well known to those of skill in the art, e.g., as described supra. 

A number of well established immunological binding assay formats suitable 

1 5 for the practice of the invention are known (see, e.g., U.S. Patents 4,366,241; 4,376,1 10; 

4,517,288; and 4,837,168). See, e.g., Methods in Cell Biology Volume 37: Antibodies 
in Cell Biology, Asai, ed. Academic Press, Inc. New York (1993); Basic and Clinical 
Immunology 7th Edition, Stites & Terr, eds. (1991); Harlow and Lane, supra [e.g., Chapter 
14], and Ausubel et al., supra, [e.g., Chapter 1 1], each of which is incorporated by reference 

20 in its entirety and for all purposes. Typically, immunological binding assays (or 

immunoassays) utilize a "capture agent" to specifically bind to and, often, immobilize the 
analyte. In one embodiment, the capture agent is a moiety that specifically binds to an hTRT 
polypeptide or subsequence, such as an anti-hTRT antibody. In an alternative embodiment, 
the capture agent may bind an hTRT-associated protein or RNA under conditions in which 

25 the hTRT-associated molecule remains bound to the hTRT (such that if the hTRT-associated 
molecule is immobilized the hTRT protein is similarly immobilized). It will be understood 
that in assays in which an hTRT-associated molecule is captured the associated hTRT protein 
will usually be present and so can be detected, e.g., using an anti-hTRT antibody or the like. 
Immunoassays for detecting protein complexes are known in the art (see, e.g., Harlow and 

30 Lane, supra, at page 583). 

Usually the hTRT gene product being assayed is detected directly or indirectly 
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using a detectable label. The particular label or detectable group used in the assay is usually 
not a critical aspect of the invention, so long as it does not significantly interfere with the 
specific binding of the antibody or antibodies used in the assay. The label may be covalently 
attached to the capture agent (e.g., an anti-TRT antibody), or may be attached to a third 
5 moiety, such as another antibody, that specifically binds to, e.g., : the hTRT polypeptide (at a 
different epitope than recognized by the capture agent), the capture agent (e.g., an anti-(first 
antibody) immunoglobulin); an anti-TRT antibody; an antibody that binds an anti-TRT 
antibody; or, an antibody/telomerase complex (e.g., via binding to an associated molecule 
such as a telomerase-associated protein). Other proteins capable of binding an antibody used 
10 in the assay, such as protein A or protein G, may also be labeled. In some embodiments, it 

will be useful to use more than one labeled molecule (i.e., ones that can be distinguished from 
one another). In addition, when the target bound (e.g., immobilized) by the capture agent 
(e.g., anti-hTRT antibody) is a complex (i.e., a complex of hTRT and a TRT-associated 
protein, hTR, or other TRT associated molecule), a labeled antibody that recognizes the 
1 5 protein or RNA associated with the hTRT protein can be used. When the complex is a 

protein-nucleic acid complex (e.g., TRT-hTR), the reporter molecule can be a polynucleotide 
or other molecule (e.g., enzyme) that recognizes the RNA component of the complex. 

Some immunoassay formats do not require the use of labeled components. 
For instance, agglutination assays can be used to detect the presence of the target antibodies. 
20 In this case, antigen-coated particles are agglutinated by samples comprising the target 

antibodies. In this format, the components do not need to be labeled, and the presence of the 
target antibody can be detected by simple visual inspection. 

b) NON-COMPETITIVE ASSAY FORMATS 
25 The present invention provides methods and reagents for competitive and 

noncompetitive immunoassays for detecting hTRT polypeptides. Noncompetitive 
immunoassays are assays in which the amount of captured analyte (in this case hTRT) is 
directly measured. One such assay is a two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering epitopes on the hTRT protein. See, 
30 e.g., Maddox et al, 1983, J. Exp. Med., 158:121 1 for background information. In one 

preferred "sandwich" assay, the capture agent (e.g., an anti-TRT antibody) is bound directly 
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to a solid substrate where it is immobilized. These immobilized antibodies then capture any 
hTRT protein present in the test sample. The hTRT thus immobilized can then be labeled, 
i.e., by binding to a second anti-hTRT antibody bearing a label. Alternatively, the second 
anti-hTRT antibody may lack a label, but be bound by a labeled third antibody specific to 
antibodies of the species from which the second antibody is derived. The second antibody 
alternatively can be modified with a detectable moiety, such as biotin, to which a third 
labeled molecule can specifically bind, such as enzyme-labeled streptavidin. 

c) COMPETITIVE ASSAY FORMATS 

In competitive assays, the amount of hTRT protein present in the sample is 
measured indirectly by measuring the amount of an added (exogenous) hTRT displaced (or 
competed away) from a capture agent (e.g., anti-TRT antibody) by the hTRT protein present 
in the sample. In one competitive assay, a known amount of labeled hTRT protein is added 
to the sample and the sample is then contacted with a capture agent (e.g., an antibody that 
specifically binds hTRT protein). The amount of exogenous (labeled) hTRT protein bound to 
the antibody is inversely proportional to the concentration of hTRT protein present in the 
sample. In one embodiment, the antibody is immobilized on a solid substrate. The amount 
of hTRT protein bound to the antibody may be determined either by measuring the amount of 
hTRT protein present in a TRT/antibody complex, or alternatively by measuring the amount 
of remaining uncomplexed TRT protein. The amount of hTRT protein may be detected by 
providing a labeled hTRT molecule. 

A hapten inhibition assay is another example of a competitive assay. In this 
assay hTRT protein is immobilized on a solid substrate. A known amount of anti-TRT 
antibody is added to the sample, and the sample is then contacted with the immobilized hTRT 
protein. In this case, the amount of anti-TRT antibody bound to the immobilized hTRT 
protein is inversely proportional to the amount of hTRT protein present in the sample. The 
amount of immobilized antibody may be detected by detecting either the immobilized 
fraction of antibody or the fraction of the antibody that remains in solution. In this aspect, 
detection may be direct, where the antibody is labeled, or indirect where the label is bound to 
a molecule that specifically binds to the antibody as described above. 
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d) OTHER ASSAY FORMATS 
The invention also provides reagents and methods for detecting and 
quantifying the presence of hTRT in the sample by using an immunoblot (Western blot) 
format. In this format, hTRT polypeptides in a sample are separated from other sample 
5 components by gel electrophoresis (e.g., on the basis of molecular weight), the separated 
proteins are transferred to a suitable solid support (such as a nitrocellulose filter, a nylon 
filter, derivatized nylon filter, or the like), and the support is incubated with anti-TRT 
antibodies of the invention. The anti-TRT antibodies specifically bind to hTRT or other TRT 
on the solid support. These antibodies may be directly labeled or alternatively may be 
10 subsequently detected using labeled antibodies (e.g., labeled sheep anti-mouse antibodies) or 
other labeling reagents that specifically bind to the anti-TRT antibody. 

Other assay formats include liposome immunoassays (LIA), which use 
liposomes designed to bind specific molecules (e.g., antibodies) and release encapsulated 
reagents or markers. The released chemicals can then be detected according to standard 
15 techniques (see, Monroe et al., 1986, Amer. Clin. Prod. Rev. 5:34). 

As noted supra, assay formats using FACS (and equivalent instruments or 
methods) have advantages when measuring hTRT gene products in a heterogeneous sample 
(such as a biopsy sample containing both normal and malignant cells). 

20 e) SUBSTRATES, SOLID SUPPORTS, MEMBRANES, FILTERS 

As noted supra, depending upon the assay, various components, including the 
antigen, target antibody, or anti-hTRT antibody, may be bound to a solid surface or support 
(i.e., a substrate, membrane, or filter paper). Many methods for immobilizing biomolecules 
to a variety of solid surfaces are known in the art. For instance, the solid surface may be a 

25 membrane (e.g., nitrocellulose), a microtiter dish (e.g., PVC, polypropylene, or polystyrene), 
a test tube (glass or plastic), a dipstick (e.g. glass, PVC, polypropylene, polystyrene, latex, 
and the like), a microcentrifuge tube, or a glass or plastic bead. The desired component may 
be covalently bound or noncovalently attached through nonspecific bonding. 

A wide variety of organic and inorganic polymers, both natural and synthetic 

30 may be employed as the material for the solid surface. Illustrative polymers include 
polyethylene, polypropylene, poly(4-methylbutene), polystyrene, polymethacrylate, 
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polyethylene terephthalate), rayon, nylon, poly(vinyl butyrate), polyvinylidene difluoride 
(PVDF), silicones, polyformaldehyde, cellulose, cellulose acetate, nitrocellulose, and the like. 
Other materials which may be employed, include paper, glasses, ceramics, metals, metalloids, 
semiconductive materials, cements or the like. In addition, substances that form gels, such as 
proteins (e.g., gelatins), lipopoly saccharides, silicates, agarose and polyacrylamides can be 
used. Polymers which form several aqueous phases, such as dextrans, polyalkylene glycols 
or surfactants, such as phospholipids, long chain (12-24 carbon atoms) alkyl ammonium salts 
and the like are also suitable. Where the solid surface is porous, various pore sizes may be 
employed depending upon the nature of the system. 

In preparing the surface, a plurality of different materials may be employed, 
particularly as laminates, to obtain various properties. For example, protein coatings, such as 
gelatin can be used to avoid non-specific binding, simplify covalent conjugation, enhance 
signal detection or the like. 

If covalent bonding between a compound and the surface is desired, the 
surface will usually be polyfunctional or be capable of being polyfunctionalized. Functional 
groups which may be present on the surface and used for linking can include carboxylic 
acids, aldehydes, amino groups, cyano groups, ethylenic groups, hydroxyl groups, mercapto 
groups and the like. The manner of linking a wide variety of compounds to various surfaces 
is well known and is amply illustrated in the literature. See, for example, Immobilized 
Enzymes, Ichiro Chibata, Halsted Press, New York, 1978, and Cuatrecasas (1970) J. Biol 
Chem. 245 3059). 

In addition to covalent bonding, various methods for noncovalently binding an 
assay component can be used. Noncovalent binding is typically nonspecific absorption of a 
compound to the surface. 

One of skill in the art will appreciate that it is often desirable to reduce non- 
specific binding in immunoassays. Particularly, where the assay involves an antigen or 
antibody immobilized on a solid substrate it is desirable to minimize the amount of non- 
specific binding to the substrate. Means of reducing such non-specific binding are well 
known to those of skill in the art. Typically, this involves coating the substrate with a 
proteinaceous composition. In particular, protein compositions such as bovine serum 
albumin (BSA), nonfat powdered milk, and gelatin are widely used with powdered milk 
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sometimes preferred. Alternatively, the surface is designed such that it nonspecifically binds 
one component but does not significantly bind another. For example, a surface bearing a 
lectin such as Concanavalin A will bind a carbohydrate containing compound but not a 
labeled protein that lacks glycosylation. Various solid surfaces for use in noncovalent 
5 attachment of assay components are reviewed in U.S. Patent Nos. 4,447,576 and 4,254,082. 

H) ASSAYS FOR ANTI-TRT ANTIBODIES 

The present invention also provides reagents and assays for detecting 
hTRT-specific immunoglobulins. In one embodiment, immobilized hTRT (e.g., recombinant 

1 0 hTRT bound to a microassay plate well) is incubated with serum from a patient under 

conditions in which anti-hTRT antibodies, if present, bind the immobilized hTRT. After 
washing to remove nonspecifically bound immunoglobulin, bound serum antibodies can be 
detected, if they are present, by adding detectably labeled anti-(human Ig) antibodies 
(alternative embodiments and variations are well known to those of skill in the art; see, e.g., 

15 Harlow, supra, at Ch. 14). These assays are useful for detecting anti-hTRT antibodies in any 
source including animal or human serum or a carrier such as saline. In one embodiment, the 
assays are used to detect or monitor an immune response to hTRT proteins in a patient, 
particularly an autoimmune (e.g., anti-telomerase) response. Anti-hTRT antibodies may be 
present in the serum or other tissues or fluids from a patient suffering from an autoimmune 

20 disease or other condition. 



I) ASSAY COMBINATIONS 

The diagnostic and prognostic assays described herein can be carried out in 
various combinations and can also be carried out in conjunction with other diagnostic or 

25 prognostic tests. For example, when the present methods are used to detect the presence of 
cancer cells in patient sample, the presence of hTRT can be used to determine the stage of the 
disease, whether a particular tumor is likely to invade adjoining tissue or metastasize to a 
distant location, and whether a recurrence of the cancer is likely. Tests that may provide 
additional information include microscopic analysis of biopsy samples, detection of antigens 

30 (e.g., cell-surface markers) associated with tumorigenicity (e.g., using histocytochemistry, 
FACS, or the like), imaging methods (e.g., upon administration to a patient of labeled anti- 
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tumor antibodies), telomerase activity assays, telomere length assays, hTR assays, or the like. 
Such combination tests can provide useful information regarding the progression of a disease. 

It will also be recognized that combinations of assays can provide useful 
information. For example, and as noted above, assays for hTRT mRNA can be combined 
with assays for hTR (human telomerase RNA) or telomerase activity (i.e., TRAP) assays to 
provide information about telomerase assembly and function. 

J) KITS 

The present invention also provides kits useful for the screening, monitoring, 
diagnosis and prognosis of patients with a telomerase-related condition, or for determination 
of the level of expression of hTRT in cells or cell lines. The kits include one or more 
reagents for determining the presence or absence of an hTRT gene product (RNA or protein) 
or for quantifying expression of the hTRT gene. Preferred reagents include nucleic acid 
primers and probes that specifically bind to the hTRT gene, RNA, cDNA, or portions thereof, 
along with proteins, peptides, antibodies, and control primers, probes, oligonucleotides, 
proteins, peptides and antibodies. Other materials, including enzymes (e.g., reverse 
transcriptases, DNA polymerases, ligases), buffers, reagents (labels, dNTPs), may be 
included. 

The kits may include alternatively, or in combination with any of the other 
components described herein, an antibody that specifically binds to hTRT polypeptides or 
subsequences thereof. The antibody can be monoclonal or polyclonal. The antibody can be 
conjugated to another moiety such as a label and/or it can be immobilized on a solid support 
(substrate). The kit(s) may also contain a second antibody for detection of hTRT 
polypeptide/antibody complexes or for detection of hybridized nucleic acid probes, as well as 
one or more hTRT peptides or proteins for use as control or other reagents. 

The antibody or hybridization probe may be free or immobilized on a solid 
support such as a test tube, a microtiter plate, a dipstick and the like. The kit may also 
contain instructional materials teaching the use of the antibody or hybridization probe in an 
assay for the detection of TRT. The kit may contain appropriate reagents for detection of 
labels, or for labeling positive and negative controls, washing solutions, dilution buffers and 
the like. 
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In one embodiment, the kit includes a primer pair for amplifying hTRT 
mRNA. Such a kit may also include a probe for hTRT amplified DNA and/or a polymerase, 
buffer, dNTPs, and the like. In another, the kit comprises a probe, optionally a labeled probe. 
In another, the kit comprises an antibody. 

X. IDENTIFICATION OF MODULATORS OF TELOMERASE ACTIVITY 
A. GENERALLY 

The invention provides compounds and treatments that modulate the activity 
or expression of a telomerase or telomerase component (e.g., hTRT protein). The invention 
also provides assays and screening methods (including high-throughput screens) for 
identification of compounds and treatments that modulate telomerase activity or expression. 
These modulators of telomerase activity and expression (hereinafter referred to as 
"modulators") include telomerase agonists (which increase telomerase activity and/or 
expression) and telomerase antagonists (which decrease telomerase activity and/or 
expression). 

The modulators of the invention have a wide variety of uses. For example, it 
is contemplated that telomerase modulators will be effective therapeutic agents for treatment 
of human diseases. Screening for agonist activity and transcriptional or translational 
activators provides for compositions that increase telomerase activity in a cell (including a 
telomere dependent replicative capacity, or a "partial" telomerase activity). Such agonist 
compositions provide for methods of immortalizing otherwise normal untransformed cells, 
including cells which can express useful proteins. Such agonists can also provide for 
methods of controlling cellular senescence. Conversely, screening for antagonist activity 
provides for compositions that decrease telomere dependent replicative capacity, thereby 
mortalizing otherwise immortal cells, such as cancer cells. Screening for antagonist activity 
provides for compositions that decrease telomerase activity, thereby preventing unlimited cell 
division of cells exhibiting unregulated cell growth, such as cancer cells. Illustrative diseases 
and conditions that may be treated using modulators are listed herein, e.g., in Sections VII 
and IX, supra. In general, the modulators of the invention can be used whenever it is desired 
to increase or decrease a telomerase activity in a cell or organism. Thus, in addition to use in 
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treatment of disease, a modulator that increases hTRT expression levels can be used to 
produce a cultured human cell line having properties as generally described in Section VIII, 
supra, and various other uses that will be apparent to one of skill. 

A compound or treatment modulates "expression" of telomerase or a 
telomerase component when administration of the compound or treatment changes the rate or 
level of transcription of the gene encoding a telomerase component (e.g., the gene encoding 
hTRT mRNA), affects stability or post-transcriptional processing of RNA encoding a 
telomerase component (e.g., transport, splicing, polyadenylation, or other modification), 
affects translation, stability, post-translational processing or modification of an encoded 
protein (e.g., hTRT), or otherwise changes the level of functional (e.g., catalytically active) 
telomerase RNP. A compound or treatment affects a telomerase "activity" when 
administration of the compound or treatment changes a telomerase activity such as any 
activity described in Section IV(B), supra (e.g., including processive or non-processive 
telomerase catalytic activity; telomerase processivity; conventional reverse transcriptase 
activity; nucleolytic activity; primer or substrate binding activity; dNTP binding activity; 
RNA binding activity; telomerase RNP assembly; and protein binding activity). It will be 
appreciated that there is not necessarily a sharp delineation between changes in "activity" and 
changes in "expression," and that these terms are used for ease of discussion and not for 
limitation. It will also be appreciated that the modulators of the invention should specifically 
affect telomerase activity or expression (e.g., without generally changing the expression of 
housekeeping proteins such as actin) rather than, for example, reducing expression of a 
telomerase component by nonspecific poisoning of a target cell. 

B. ASSAYS FOR IDENTIFICATION OF TELOMERASE MODULATORS 

The invention provides methods and reagents to screen for compositions or 
compounds capable of affecting expression of a telomerase or telomerase component, capable 
of modifying the DNA replicative capacity of telomerase, or otherwise modifying the ability 
of the telomerase enzyme and TRT protein to synthesize telomeric DNA ("full activity"). 
The invention also provides screens for modulators of any or all of hTRT s "partial 
activities." Thus, the present invention provides assays that can be used to screen for agents 
that increase the activity of telomerase, for example, by causing hTRT protein or telomerase 
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to be expressed in a cell in which it normally is not expressed or by increasing telomerase 
activity levels in telomerase positive cells. 

Telomerase or telomerase subunit proteins or their catalytic or immunogenic 
fragments or oligopeptides thereof, can be used for screening therapeutic compounds in any 
of a variety of drug screening techniques. The fragment employed in such a test may be free 
in solution, affixed to a solid support, borne on a cell surface, or located intracellularly. The 
formation of binding complexes, between telomerase or the subunit protein and the agent 
being tested, may be measured. 

In various embodiments, the invention includes methods for screening for 
antagonists that: bind to the enzyme's active site; inhibit the association of its RNA moiety, 
telomerase-associated proteins, nucleotides, or telomeric DNA to telomerase or hTRT 
protein; promote the disassociation of the enzyme complex; interfere with transcription of the 
telomerase RNA moiety (e.g., hTR); or inhibit any of the "partial activities" described 
herein. The invention provides methods for screening for compositions that inhibit the 
association of nucleic acid and/or telomerase-associated compositions with hTRT, such as the 
association of hTR with hTRT or the association of hTRT with the human homologs of p80 
or p95 or another associated protein, or association of hTRT with a telomere or a nucleotide; 
screening for compositions that promote the disassociation or promote the association (i.e., 
assembly) of the enzyme complex, such as an antibody directed to hTR or hTRT; screening 
for agents that effect the processivity of the enzyme; and screening for nucleic acids and other 
compositions that bind to telomerase, such as a nucleic acid complementary to hTR. The 
invention further contemplates screening for compositions that increase or decrease the 
transcription of the hTRT gene and/or translation of the hTRT gene product. The invention 
also contemplates a method of screening for telomerase modulators in animals, in one 
embodiment, by reconstituting a telomerase activity, or an anti-telomerase activity, in an 
animal, such as a transgenic animal. The invention provides for in vivo assays systems that 
include "knockout" models, in which one or several units of the endogenous telomerase, 
telomerase RNA moiety and/or telomerase-associated proteins have been deleted or inhibited. 
The endogenous telomerase activity, full or partial, can remain or be absent. In one 
embodiment, an exogenous telomerase activity, full or partial, is reconstituted. 

In one embodiment of the invention, a variety of partial activity telomerase 
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assays are provided to identify a variety of different classes of modulators of telomerase 
activity. The "partial activity" assays of the invention allow identification of classes of 
telomerase activity modulators that might otherwise not be detected in a "full activity" 
telomerase assay. One partial activity assay involves the non-processive activity of TRT and 
telomerase. The processive nature of telomerase is described by Morin (1989) Cell 59:521- 
529; see also Prowse (1993) "Identification of a nonprocessive telomerase activity from 
mouse cells" Proc. Natl Acad Scu USA 90:1493-1497. Another partial activity assay of the 
invention exploits the "reverse-transcriptase-like" activity of telomerase. In these assays, one 
assays the reverse transcriptase activity of the hTRT protein. See Lingner (1997) "Reverse 
transcriptase motifs in the catalytic subunit of telomerase" Science 276:561-567. Another 
partial activity assay of the invention exploits the "nucleolytic activity" of hTRT and 
telomerase, involving the enzyme's removing of at least one nucleotide, typically guanosine, 
from the 3 1 strand of a primer. This nucleolytic activity has been observed in Tetrahymena 
telomerase by Collins (1993) "Tetrahymena telomerase catalyzes nucleolytic cleavage and 
nonprocessive elongation" Genes Dev 7:1364-1376. Another partial activity assay of the 
invention involves analyzing hTRT's and telomerase' s ability to bind nucleotides as part of 
its enzymatic processive DNA polymerization activity. Another partial activity assay of the 
invention involves analyzing hTRT's or telomerase's ability to bind its RNA moiety, i.e., 
hTR for human cells, used as a template for telomere synthesis. Additional partial activity 
assays of the invention involve analyzing hTRT's and telomerase's ability to bind 
chromosomes in vivo, or to bind oligonucleotide primers in vitro or in reconstituted systems, 
or to bind proteins associated with chromosomal structure (see, for an example of such a 
protein, Harrington (1995) J Biol Chem 270: 8893-8901). Chromosomal structures which 
bind hTRT include, for example, telomeric repeat DNA, telomere proteins, histones, nuclear 
matrix protein, cell division/ cell cycle control proteins and the like. 

In one embodiment, an assay for identification of modulators comprises 
contacting one or more cells (i.e., "test cells") with a test compound, and determining whether 
the test compound affects expression or activity of a telomerase (or telomerase component) in 
the cell Usually this determination comprises comparing the activity or expression in the test 
cell compared to a similar cell or cells (i.e., control cells) that have not been contacted with 
the test compound. Alternatively, cell extracts may be used in place of intact cells. In a 
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related embodiment, the test compound is administered to a multicellular organism (e.g., a 
plant or animal). The telomerase or telomerase component may be wholly endogenous to the 
cell or multicellular organism (i.e., encoded by naturally occurring endogenous genes), or 
may be a recombinant cell or transgenic organism comprising one or more recombinantly 
expressed telomerase components (e.g., hTRT, hTR, telomerase-associated proteins), or may 
have both endogenous and recombinant components. Thus, in one embodiment, telomerase- 
activity-modulators are administered to mortal cells. In another embodiment, telomerase- 
activity-modulators are administered to immortal cells* For example, antagonists of 
telomerase-mediated DNA replication can be identified by administering the putative 
inhibitory composition to a cell that is known to exhibit significant amounts of telomerase 
activity, such as cancer cells, and measuring whether a decrease in telomerase activity, 
telomere length, or proliferative capacity is observed, all of which are indicative of a 
compound with antagonist activity. 

In another embodiment, a modulator is identified by monitoring a change in a 
telomerase activity of a ribonucleoprotein complex (RNP) comprising a TRT (e.g., hTRT) 
and a template RNA (e.g., hTR), which RNP is reconstituted in vitro (e.g., as described in 
Example 7, infra). 

In yet another embodiment, the modulator is identified by monitoring a change 
in expression of a TRT gene product (e.g., RNA or protein) in a cell, animal, in vitro 
expression system, or other expression system. 

In still another embodiment, the modulator is identified by changing the 
expression of a reporter gene, such as that described in Example 15, whose expression is 
regulated, in whole or part, by a naturally occurring TRT regulatory element such as a 
promoter or enhancer. In a related embodiment, the ability of a test compound to bind to a 
telomerase component (e.g., hTRT), RNA, or gene regulatory sequence (e.g., the TRT gene 
promoter) is assayed. 

In another embodiment, the modulator is identified by observing changes in 
hTRT pre-mRNA processing, for example, alternatively spliced products, alternative 
poly-adenylation events, RNA cleavage, and the like. In a related embodiment the activity of 
the modulator can be observed by monitoring the production of variant hTRT polypeptides, 
some of which may possess dominant-negative telomerase regulation activity. 
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Assay formats for identification of compounds that affect expression and 
activity of proteins are well known in the biotechnological and pharmaceutical industries, and 
numerous additional assays and variations of the illustrative assays provided supra will be 
apparent to those of skill. 

Changes in telomerase activity or expression can be measured by any suitable 
method. Changes in levels of expression of a telomerase component (e.g., hTRT protein) or 
precursor (e.g., hTRT mRNA) can be assayed using methods well known to those of skill, 
some of which are described hereinabove, e.g., in Section IX and including monitoring levels 
of TRT gene products (e.g., protein and RNAs) by hybridization (e.g., using the TRT probes 
and primers of the invention), immunoassays (e.g., using the anti-TRT antibodies of the 
invention), RNAse protection assays, amplification assays, or any other suitable detection 
means described herein or known in the art. Quantitating amounts of nucleic acid in a sample 
(e.g., evaluating levels of RNA, e.g., hTR or hTRT mRNA) is also useful in evaluating cis- or 
trans- transcriptional regulators. 

Similarly, changes in telomerase activity can be measured using methods such 
as those described herein (e.g., in Section IV(B), supra) or other assays of telomerase 
function. Quantitation of telomerase activity, when desired, may be carried out by any 
method, including those disclosed herein. Telomerase antagonists that can cause or 
accelerate loss of telomeric structure can be identified by monitoring and measuring their 
effect on telomerase activity in vivo, ex vivo, or in vitro, or by their effects on telomere length 
(as measured or detected through staining, use of tagged hybridization probes or other means) 
or, simply, by the inhibition of cell division of telomerase positive cancer cells (critical 
shortening of telomeres leads to a phenomenon termed "crisis" or M2 senescence (Shay, 
1991) Biochem. Biophys. Acta 1072:1-7), which cancer cells have bypassed by the activation 
of telomerase, but which, in the absence of telomerase, will lead to their senescence or death 
through chromosomal deletion and rearrangement). The in vivo human telomerase activity 
reconstitution provides for a method of screening for telomerase modulators in cells or 
animals from any origin. Such agonists can be identified in an activity assay of the invention, 
including measurements of changes in telomere length. Other examples of assays measuring 
telomerase activity in cells include assays for the accumulation or loss of telomere structure, 
the TRAP assay or a quantitative polymerase chain reaction assay. 
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In one embodiment, the assays of the invention also include a method where 
the test compound produces a statistically significant decrease in the activity of hTRT as 
measured by the incorporation of a labeled nucleotide into a substrate compared to the 
relative amount of incorporated label in a parallel reaction lacking the test compound, thereby 

5 determining that the test compound is a telomerase inhibitor. 

The methods of the invention are amenable to adaptations from protocols 
described in the scientific and patent literature and known in the art. For example, when a 
telomerase or TRT protein of this invention is used to identify compositions which act as 
modulators of telomerase activities, large numbers of potentially useful molecules can be 

1 0 screened in a single test. The modulators can have an inhibitory (antagonist) or potentiating 
(agonist) effect on telomerase activity. For example, if a panel of 1,000 inhibitors is to be 
screened, all 1,000 inhibitors can potentially be placed into one microtiter well and tested 
simultaneously. If such an inhibitor is discovered, then the pool of 1,000 can be subdivided 
into 10 pools of 100 and the process repeated until an individual inhibitor is identified. 

1 5 In drug screening large numbers of compounds are examined for their ability 

to act as telomerase modulators, a process greatly accelerated by the techniques of high 
throughput screening. The assays for telomerase activity, full or partial, described herein may 
be adapted to be used in a high throughput technique. Those skilled in the art appreciate that 
there are numerous methods for accomplishing this purpose. 

20 Another technique for drug screening which may be applied for high 

throughput screening of compounds having suitable binding affinity to the telomerase or 
telomerase protein subunit is described in detail in "Determination of Amino Acid Sequence 
Antigenicity" by Geysen, (Geysen, WO Application 84/03564, published on September 13, 
1984, incorporated herein by reference). In summary, large numbers of different small 

25 peptide test compounds are synthesized on a solid substrate, such as plastic pins or some 
other surface. The peptide test compounds are reacted with fragments of telomerase or 
telomerase protein subunits and washed. Bound telomerase or telomerase protein subunit is 
then detected by methods well known in the art. Substantially purified telomerase or 
telomerase protein subunit can also be coated directly onto plates for use in the 

30 aforementioned drug screening techniques. Alternatively, non-neutralizing antibodies can be 
used to capture the peptide and immobilize it on a solid support. 
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This invention also contemplates the use of competitive drug screening assays 
in which neutralizing antibodies capable of binding telomerase or subunit protein(s) 
specifically compete with a test compound for binding telomerase or the subunit protein. 
Antibodies can also be used to detect the presence of any peptide which shares one or more 
antigenic determinants with the telomerase or subunit protein. 

Additional methods for identifying modulators of a telomerase activity have 
been described in U.S. Patent No. 5,645,986, which is incorporated herein by reference. It 
will be appreciated that the present invention provides improvements to previously known 
methods, in part by providing reagents such as hTRT polynucleotides, probes and primers, 
highly purified hTR, hTRT and telomerase, as well as anti-telomerase and anti-TRT 
antibodies, all of which may be used in assays, e.g., as controls, standards, binding or 
hybridization agents, or otherwise. 

It will be recognized that the recombinantly produced telomerase and TRT 
(e.g., hTRT) of the invention will be useful in assays for identification of modulators. The 
screening assay can utilize telomerase or hTRT derived by a full or partial reconstitution of 
telomerase activity, or by an augmentation of existing activity. The assay or screens provided 
by the invention can be used to test for the ability of telomerase to synthesize telomeric DNA 
or to test for any one or all or of the "partial activities" of hTRT and TRTs generally, as 
described above. The assay can incorporate ex vivo modification of cells which have been 
manipulated to express telomerase with or without its RNA moiety or associated proteins, 
and these can be re-implanted into an animal, which can be used for in vivo testing. Thus, 
this invention provides in vivo assays and transgenic animals useful therein. These in vivo 
assays systems can employ "knockout"cells, in which one or several units of the endogenous 
telomerase enzyme complex have been deleted or inhibited, as well as cells in which an 
exogenous or endogenous telomerase activity is reconstituted or activated. 

Telomerases and TRT proteins that have been modified in a site-specific 
manner (by site-specific mutation) to modify or delete any or all functions of the telomerase 
enzyme or the TRT protein can also be employed in the screens of the invention to discover 
therapeutic agents. For example, the TRT can be engineered to lose its ability to bind 
substrate DNA, to bind its RNA moiety (as hTR), to catalyze the addition of telomeric DNA, 
to bind deoxynucleotide substrate, to have nucleolytic activity, to bind telomere-associated 
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proteins or chromosomal structures, and the like* The resulting "mutant proteins" or 
"muteens" can be used to identify compounds that specifically modulate one, several, or all 
functions or activities of the TRT protein or telomerase. 



compounds, both naturally occurring and synthetic, organic and inorganic, and including 

polymers (e.g., oligopeptides, polypeptides, oligonucleotides, and polynucleotides), small 
10 molecules, antibodies (as broadly defined herein), sugars, fatty acids, nucleotides and 

nucleotide analogs, analogs of naturally occurring structures (e.g., peptide mimetics, nucleic 

acid analogs, and the like), and numerous other compounds. 

The invention provides modulators of all types, without limitation to any 

particular mechanism of action. For illustrative purposes, examples of modulators include 
1 5 compounds or treatments that: 

(i) bind to the hTRT polypeptide (e.g., the active site of the enzyme) or other 
telomerase component, and affect a telomerase activity; 

(ii) inhibit or promote association, or inhibit or promote disassociation, of a 
telomerase component (e.g., hTRT or the hTRT-hTR RNP) with or from a 

20 telomerase-associated protein (e.g., including those described in Section IV(D), 
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C. EXEMPLARY TELOMERASE MODULATORS 



1) GENERALLY 

The test compounds referred to supra may be any of a large variety of 



supra); 



(iii) inhibit or promote association, or inhibit or promote disassociation, of 
telomerase polypeptides (e.g., hTRT) with or from a telomerase RNA (e.g., hTR); 
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(iv) inhibit or promote association, or inhibit or promote disassociation, of 
telomerase polypeptides (e.g., hTRT) with or from chromosomes (e.g., telomeres) or 
chromosomal DNA (e.g. telomeric DNA); 
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(v) increase or decrease expression of a telomerase component gene product 
(e.g., products of the hTRT gene), including change the rate or level of transcription 
of the TRT gene, or translation, transport or stability of a gene product, or the like, by 
binding to the gene or gene product (e.g., by interacting with a factor (e.g., a 
transcription regulatory protein) that affects transcription of the hTRT gene or another 



169 



telomerase component). 

2) PEPTIDE MODULATORS 

Potential modulators of telomerase activity also include peptides (e.g., 
inhibitory (antagonist) and activator (agonist) peptide modulators). For example, 
5 oligopeptides with randomly generated sequences can be screened to discover peptide 

modulators (agonists or inhibitors) of telomerase activity. Such peptides can be used directly 
as drugs or to find the orientation or position of a functional group that can inhibit telomerase 
activity that, in turn, leads to design and testing of a small molecule inhibitor, or becomes the 
backbone for chemical modifications that increase pharmacological utility. Peptides can be 

10 structural mimetics, and one can use molecular modeling programs to design mimetics based 
on the characteristic secondary structure and/or tertiary structure of telomerase enzyme and 
hTRT protein. Such structural mimetics can also be used therapeutically, in vivo, as 
modulators of telomerase activity (agonists and antagonists). Structural mimetics can also be 
used as immunogens to elicit anti-telomerase or anti-TRT protein antibodies. 

15 3) INHIBITORY NATURAL COMPOUNDS AS MODULATORS OF 

TELOMERASE ACTIVITY 

In addition, a large number of potentially useful activity-modifying 
compounds can be screened in extracts from natural products as a source material. Sources of 
such extracts can be from a large number of species of fungi, actinomyces, algae, insects, 

20 protozoa, plants, and bacteria. Those extracts showing inhibitory activity can then be 

analyzed to isolate the active molecule. See for example, Turner (1996) J. Ethnopharmacol 
51(l-3):39-43; Suh (1995) Anticancer Res. 15:233-239. 

4) INHIBITORY OLIGONUCLEOTIDES 

One particularly useful set of inhibitors provided by the present invention 
25 includes oligonucleotides which are able to either bind mRNA encoding hTRT protein or to 
the hTRT gene, in either case preventing or inhibiting the production of functional hTRT 
protein. Other oligonucleotides of the invention interact with telomerase' s RNA moiety, such 
as hTR, or are able to prevent binding of telomerase or hTRT to its DNA target, or one 
telomerase component to another, or to a substrate. Such oligonucleotides can also bind the 
30 telomerase enzyme, hTRT protein, or both protein and RNA and inhibit a partial activity as 
described above (such as its processive activity, its reverse transcriptase activity, its 
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nucleolytic activity, and the like). The association can be through sequence specific 
hybridization to another nucleic acid or by general binding, as in an aptamer, or both. 

Telomerase activity can be inhibited by targeting the hTRT mRNA with 
antisense oligonucleotides capable of binding the hTRT mRNA. 

Another useful class of inhibitors includes oligonucleotides which cause 
inactivation or cleavage of hTRT mRNA or hTR. That is, the oligonucleotide is chemically 
modified, or has enzyme activity, which causes such cleavage, such as is the case for a 
ribozyme, an EDTA-tethered oligonucleotide, or a covalently bound oligonucleotide, such as 
a psoralen or other cross-linking reagent bound oligonucleotide. As noted above, one may 
screen a pool of many different such oligonucleotides for those with the desired activity. 

Another useful class of inhibitors includes oligonucleotides which bind 
polypeptides. Double- or single-stranded DNA or double- or single-stranded RNA molecules 
that bind to specific polypeptides targets are called "aptamers." The specific oligonucleotide- 
polypeptide association may be mediated by electrostatic interactions. For example, aptamers 
specifically bind to anion-binding exosites on thrombin, which physiologically binds to the 
polyanionic heparin (Bock (1992) Nature 355:564-566). Because hTRT protein binds both 
hTR and its DNA substrate, and because the present invention provides hTRT and other TRT 
proteins in purified form in large quantities, those of skill in the art can readily screen for 
TRT-binding aptamers using the methods of the invention. 

Oligonucleotides (e.g., RNA oligonucleotides) that bind telomerase, hTRT, 
hTR, or portions thereof, can be generated using the techniques of SELEX (Tuerk, 1997, 
Methods Mol Biol 67, 2190). In this technique a very large pool (106-109) of random 
sequence nucleic acids is bound to the target (e.g. hTRT) using conditions that cause a large 
amount of discrimination between molecules with high affinity and low affinity for binding 
the target. The bound molecules are separated from unbound, and the bound molecules are 
amplified by virtue of a specific nucleic acid sequence included at their termini and suitable 
amplification reagents. This process is reiterated several times until a relatively small number 
of molecules remain that possess high binding affinity for the target These molecules can 
then be tested for their ability to modulate telomerase activity as described herein. 

Antagonists of telomerase-mediated DNA replication can also be based on 
inhibition of hTR (Norton (1996) Nature Biotechnology 14:615-619) through complementary 
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sequence recognition or cleavage, as through ribozymes. 

The inhibitory oligonucleotides of the invention can be transferred into the cell 
using a variety of techniques well known in the art. For example, oligonucleotides can be 
delivered into the cytoplasm without specific modification. Alternatively, they can be 
5 delivered by the use of liposomes which fuse with the cellular membrane or are endocytosed, 
i.e., by employing ligands attached to the liposome or directly to the oligonucleotide, that 
bind to surface membrane protein receptors of the cell resulting in endocytosis. 
Alternatively, the cells may be permeabilized to enhance transport of the oligonucleotides 
into the cell, without injuring the host cells. One can use a DNA binding protein, e.g., 
1 0 HBGF- 1 , known to transport an oligonucleotide into a cell. 

5) INHIBITORY RIBOZYMES 

Ribozymes act by binding to a target RNA through the target RNA binding 
portion of a ribozyme which is held in close proximity to an enzymatic portion of the 
ribozyme that cleaves the target RNA. Thus, the ribozyme recognizes and binds a target RNA 

1 5 usually through complementary base-pairing, and once bound to the correct site, acts 

enzymatically to cleave and inactivate the target RNA. Cleavage of a target RNA in such a 
manner will destroy its ability to direct synthesis of an encoded protein if the cleavage occurs 
in the coding sequence. After a ribozyme has bound and cleaved its RNA target, it is typically 
released from that RNA and so can bind and cleave new targets repeatedly. 

20 6) IDENTIFYING TELOMERASE- ASSOCIATED PROTEINS FOR 

USE AS MODULATORS 

In one embodiment of the invention, telomerase is used to identify telomerase- 
associated proteins, z.e., telomerase accessory proteins which modulate or otherwise 
complement telomerase activity. As noted above, these proteins or fragments thereof can 

25 modulate function by causing the dissociation or preventing the association of the telomerase 
enzyme complex, preventing the assembly of the telomerase complex, preventing hTRT from 
binding to its nucleic acid complement or to its DNA template, preventing hTRT from 
binding nucleotides, or preventing, augmenting, or inhibiting any one, several or all of the 
partial activities of the telomerase enzyme or hTRT protein, as described above. 

30 One of skill in the art can use the methods of the invention to identify which 

portions (e.g., domains) of these telomerase-associating proteins contact telomerase. In one 
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embodiment of the invention, these telomerase-associating proteins or fragments thereof are 
used as modulators of telomerase activity. 

7) TELOMERASE-ASSOCIATED PROTEINS AS DOMINANT 

NEGATIVE MUTANTS 

5 In one embodiment of the invention, telomerase-associated proteins are used 

as modulators of telomerase activity. Telomerase-associated proteins include chromosomal 
structures, such as histones, nuclear matrix proteins, cell division and cell cycle control 
proteins, and the like. Other telomerase-associated proteins which can be used as modulators 
for the purpose of the invention include the p80 and p95 proteins and their human homologs, 

10 such as TP1 and TRF-1 (Chong , 1995, Science 270:1663-1667). In addition, fragments of 
these telomerase-associated proteins can be identified by the skilled artisan in accordance 
with the methods of the invention and used as modulators of telomerase activity. 

8) DOMINANT NEGATIVE MUTANTS 

Eight highly conserved motifs have been identified between TRTs of different 
1 5 non-human species, as described above (see also Lingner (1 997) Science 276:5 6 1 -567). 

Figure 4 shows a schematic of the human TRT amino acid sequence (from pGRN121) and 
RT motifs as compared to S. pombe Trtlp, Euplotes pl23 and S. cerevisiae Est2 p. The 
present invention provides recombinant and synthetic nucleic acids in which the codons for 
the conserved amino acid residues in each, alone or in conjunction with one or more 
20 additional codons, of all eight of these motifs has been a changed to each of the other codons. 
A variety of the resulting coding sequences express a non-functional hTRT. See, for 
instance, Example 16. Thus, the present invention provides, for example, a wide variety of 
"mutated" telomerase enzymes and TRT proteins which have a partial activity but not full 
activity of telomerase. For example, one such telomerase is able to bind telomeric structures, 
25 but not bind telomerase-associated RNA (i. e. , hTR). If expressed at high enough levels, such 
a telomerase mutant can deplete a necessary telomerase component (e.g., hTR) and thereby 
function as an inhibitor of wild-type telomerase activity. A mutated telomerase acting in this 
manner is an antagonist or a so-called "dominant-negative" mutant. 
9) ANTIBODIES 

30 In general, the antibodies of the invention can be used to identify, purify, or 

inhibit any or all activity of telomerase enzyme and hTRT protein. Antibodies can act as 
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antagonists of telomerase activity in a variety of ways, for example, by preventing the 
telomerase complex or nucleotide from binding to its DNA substrates, by preventing the 
components of telomerase from forming an active complex, by maintaining a functional 
(telomerase complex) quaternary structure or by binding to one of the enzyme's active sites or 
other sites that have allosteric effects on activity (the different partial activities of telomerase 
are described in detail elsewhere in this specification). 

D) MODULATOR SYNTHESIS 

It is contemplated that the telomerase modulators of the invention will be 
made using methods well known in the pharmaceutical arts, including combinatorial methods 

and rational drug design techniques. 

1) COMBINATORIAL CHEMISTRY METHODOLOGY 

The creation and simultaneous screening of large libraries of synthetic 
molecules can be carried out using well-known techniques in combinatorial chemistry, for 
example, see van Breemen (1997) Anal Chem 69:2159-2164; Lam (1997) Anticancer Drug 
Des 12:145-167 (1997). 

As noted above, combinatorial chemistry methodology can be used to create 
vast numbers of oligonucleotides (or other compounds) that can be rapidly screened for 
specific oligonucleotides (or compounds) that have appropriate binding affinities and 
specificities toward any target, such as the TRT proteins of the invention, can be utilized (for 
general background information Gold (1995) J. of Biol Chem. 270:13581-13584). 

2) RATIONAL DRUG DESIGN 

Rational drug design involves an integrated set of methodologies that include 
structural analysis of target molecules, synthetic chemistries, and advanced computational 
tools. When used to design modulators, such as antagonists/inhibitors of protein targets, such 
as telomerase enzyme and hTRT protein, the objective of rational drug design is to 
understand a molecule's three-dimensional shape and chemistry. Rational drug design is 
aided by X-ray crystallographic data or NMR data, which can now be determined for the 
hTRT protein and telomerase enzyme in accordance with the methods and using the reagents 
provided by the invention. Calculations on electrostatics, hydrophobicities and solvent 
accessibility is also helpful. See, for example, Coldren (1997) Proc. Natl. Acad. Sci. USA 
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94:6635-6640. 

E) KITS 

5 The invention also provides kits that can be used to aid in determining whether 

a test compound is a modulator of a TRT activity. The kit will typically include one or more 
of the following components: a substantially purified TRT polypeptide or polynucleotide 
(including probes and primers); a plasmid capable of expressing a TRT (e.g., hTRT) when 
introduced into a cell or cell-free expression system; a plasmid capable of expressing a TR 

10 (e.g., hTR) when introduced into a cell or cell-free expression system; cells or cell lines; a 
composition to detect a change in TRT activity; and, an instructional material teaching a 
means to detect and measure a change in the TRT activity, indicating that a change in the 
telomerase activity in the presence of the test compound is an indicator that the test 
compound modulates the telomerase activity, and one or more containers. The kit can also 

1 5 include means, such as TRAP assay reagents or reagents for a quantitative polymerase chain 
reaction assay, to measure a change in TRT activity. The kit may also include instructional 
material teaching a means to detect and measure a change in the TRT activity, indicating that 
a change in the telomerase activity in the presence of the test compound is an indicator that 
the test compound modulates the telomerase activity. 

20 

XI. TRANSGENIC ORGANISMS (TELOMERASE KNOCKOUT CELLS AND 
ANIMAL MODELS) 

The invention also provides transgenic non-human multicellular organisms 
(e.g., plants and non-human animals) or unicellular organisms (e.g., yeast) comprising an 
25 exogenous TRT gene sequence, which may be a coding sequence or a regulatory (e.g., 
promoter) sequence. In one embodiment, the organism expresses an exogenous TRT 
polypeptide, having a sequence of a human TRT protein. In a related embodiment, the 
organism also expresses a telomerase RNA component (e.g., hTR). 

The invention also provides unicellular and multicellular organisms (or cells 
30 therefrom) in which at least one gene encoding a telomerase component (e.g., TRT or TR) or 
telomerase-associated protein is mutated or deleted (i.e., in a coding or regulatory region) 
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such that native telomerase is not expressed, or is expressed at reduced levels or with 
different activities when compared to wild-type cells or organisms. Such cells and organisms 
are often referred to as "gene knock-out" cells or organisms. 

The invention further provides cells and organisms in which an endogenous 

5 telomerase gene (e.g., murine TRT) is either present or optionally mutated or deleted and an 
exogenous telomerase gene or variant (e.g., human TRT) is introduced and expressed. Cells 
and organisms of this type will be useful, for example, as model systems for identifying 
modulators of hTRT activity or expression; determining the effects of mutations in 
telomerase component genes, and other uses such as determining the developmental timing 

1 0 and tissue location of telomerase activity (e.g., for assessing when to administer a telomerase 
modulator and for assessing any potential side effects). 

Examples of multicellular organisms include plants, insects, and nonhuman 
animals such as mice, rats, rabbits, monkeys, apes, pigs, and other nonhuman mammals. An 
example of a unicellular organism is a yeast. 

1 5 Methods for alteration or disruption of specific genes (e.g., endogenous TRT 

genes) are well known to those of skill, see, e.g., Baudin et al., 1993, Nucl Acids Res. 
21:3329; Wach etaU 1994, Yeast 10:1793; Rothstein, 1991, Methods Enzymol 194:281; 
Anderson, 1995, Methods Cell Biol 48:31; Pettitt et al., 1996, Development 122:4149-4157; 
Ramirez-Solis et al., 1993, Methods Enzymol 225:855; and Thomas et al., 1987, Cell 51 :503, 

20 each of which is incorporated herein by reference in its entirety for all purposes. 

The "knockout" cells and animals of the invention include cells and animals in 
which one or several units of the endogenous telomerase enzyme complex have been deleted 
or inhibited. Reconstitution of telomerase activity will save the cell or animal from 
senescence or, for cancer cells, cell death caused by its inability to maintain telomeres. 

25 Methods of altering the expression of endogenous genes are well known to those of skill in 
the art. Typically, such methods involve altering or replacing all or a portion of the 
regulatory sequences controlling expression of the particular gene to be regulated. The 
regulatory sequences, e.g., the native promoter can be altered. The conventional technique 
for targeted mutation of genes involves placing a genomic DNA fragment containing the gene 

30 of interest into a vector, followed by cloning of the two genomic arms associated with the 
targeted gene around a selectable neomycin-resistance cassette in a vector containing 
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thymidine kinase. This "knock-out" construct is then transfected into the appropriate host 
cell, i.e., a mouse embryonic stem (ES) cell, which is subsequently subjected to positive 
selection (using G418, for example, to select for neomycin-resistance) and negative selection 
(using, for example, FIAU to exclude cells lacking thymidine kinase), allowing the selection 
5 of cells which have undergone homologous recombination with the knockout vector. This 
approach leads to inactivation of the gene of interest. See, e.g., U.S. patents 5,464,764; 
5,631,153; 5,487,992; and, 5,627,059. 

"Knocking out" expression of an endogenous gene can also be accomplished 
by the use of homologous recombination to introduce a heterologous nucleic acid into the 

10 regulatory sequences (e.g., promoter) of the gene of interest. To prevent expression of 

functional enzyme or product, simple mutations that either alter the reading frame or disrupt 
the promoter can be suitable. To up-regulate expression, a native promoter can be substituted 
with a heterologous promoter that induces higher levels of transcription. Also, "gene trap 
insertion" can be used to disrupt a host gene, and mouse ES cells can be used to produce 

15 knockout transgenic animals, as described for example, in Holzschu (1997) Transgenic Res 6: 
97-106. 

Altering the expression of endogenous genes by homologous recombination 
can also be accomplished by using nucleic acid sequences comprising the structural gene in 
question. Upstream sequences are utilized for targeting heterologous recombination 

20 constructs. Utilizing TRT structural gene sequence information, such as SEQUENCE ID 
NO: 1 , one of skill in the art can create homologous recombination constructs with only 
routine experimentation. Homologous recombination to alter expression of endogenous 
genes is described in U.S. Patent 5,272,071, and WO 91/09955, WO 93/09222, WO 
96/2941 1, WO 95/31560, and WO 91/12650. Homologous recombination in mycobacteria is 

25 described by Azad (1996) Proc. Natl Acad Sci. USA 93:4787; Baulard (1996) J. 

Bacteriol 178:3091; and Pelicic (1996) Mol Microbiol 20:919. Homologous recombination 
in animals has been described by Moynahan (1996) Hum. Mol Genet 5:875, and in plants by 
Offringa (1990) EMBO J. 9:3077. 



30 XII. GLOSSARY 

The following terms are defined infra to provide additional guidance to one of 
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skill in the practice of the invention: adjuvant, allele (& allelic sequence), amino acids 
(including hydrophobic, polar, charged), conservative substitution, control elements (& 
regulatory sequences), derivatized, detectable label, elevated level, epitope, favorable and 
unfavorable prognosis, fusion protein, gene product, hTR, immortal, immunogen and 
immunogenic, isolated, modulator, motif, nucleic acid (& polynucleotide), oligonucleotides 
(& oligomers), operably linked, polypeptide, probe (including nucleic acid probes & antibody 
probes), recombinant, selection system, sequence, specific binding, stringent hybridization 
conditions (& stringency), substantial identity (& substantial similarity), substantially pure (& 
substantially purified), telomerase-negative and telomerase-positive cells, telomerase 
catalytic activity, telomerase-related, and test compound. 

As used herein, the term "adjuvant" refers to its ordinary meaning of any 
substance that enhances the immune response to an antigen with which it is mixed. 
Adjuvants useful in the present invention include, but are not limited to, Freund's, mineral 
gels such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic 
polyols, polyanions, peptides, oil emulsions, keyhole limpet hemocyanin, and dinitrophenol. 
BCG (Bacillus Calmette-Guerin) and Corynebacterium parvum are potentially useful 
adjuvants. 

As used herein, the terms "allele" or "allelic sequence" refer to an alternative 
form of a nucleic acid sequence (i.e., a nucleic acid encoding hTRT protein). Alleles result 
from mutations (i.e., changes in the nucleic acid sequence), and generally produce altered 
and/or differently regulated mRNAs or polypeptides whose structure and/or function may or 
may not be altered. Common mutational changes that give rise to alleles are generally 
ascribed to natural deletions, additions, or substitutions of nucleotides that may or may not 
affect the encoded amino acids. Each of these types of changes may occur alone, in 
combination with the others, or one or more times within a given gene, chromosome or other 
cellular nucleic acid. Any given gene may have no, one or many allelic forms. As used 
herein, the term "allele" refers to either or both a gene or an mRNA transcribed from the 
gene. 
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As used herein, "amino acids" are sometimes specified using the standard one 



letter code: Alanine (A), Serine (S), Threonine (T), Aspartic acid (D), Glutamic acid (E) 
Asparagine (N), Glutamine (Q), Arginine (R), Lysine (K), Isoleucine (I), Leucine (L), 
Methionine (M), Valine (V), Phenylalanine (F), Tyrosine (Y), Tryptophan (W), Proline (P), 
5 Glycine (G), Histidine (H), Cysteine (C). Synthetic and non-naturally occurring amino acid 
analogues (and/or peptide linkages) are included. 

As used herein, "Hydrophobic amino acids" refers to A, L, I, V, P, F, W, and 
M. As used herein, "polar amino acids" refers to G, S, T, Y, C, N, and Q. As used herein, 
10 "charged amino acids" refers to D, E, H, K, and R. 

As used herein,"conservative substitution", when describing a protein refers 
to a change in the amino acid composition of the protein that does not substantially alter the 
protein's activity. Thus, "conservatively modified variations" of a particular amino acid 

15 sequence refers to amino acid substitutions of those amino acids that are not critical for 
protein activity or substitution of amino acids with other amino acids having similar 
properties (e.g., acidic, basic, positively or negatively charged, polar or non-polar, etc.) such 
that the substitutions of even critical amino acids does not substantially alter activity. 
Conservative substitution tables providing functionally similar amino acids are well known in 

20 the art. The following six groups each contain amino acids that are conservative substitutions 
for one another: 1) Alanine (A), Serine (S), Threonine (T); 2) Aspartic acid (D), Glutamic 
acid (E); 3) Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (I), 
Leucine (L), Methionine (M), Valine (V); and 6) Phenylalanine (F), Tyrosine (Y), 
Tryptophan (W) (see also, Creighton (1984) Proteins, W.H. Freeman and Company). One of 

25 skill in the art will appreciate that the above-identified substitutions are not the only possible 
conservative substitutions. For example, one may regard all charged amino acids as 
conservative substitutions for each other whether they are positive or negative. In addition, 
individual substitutions, deletions or additions which alter, add or delete a single amino acid 
or a small percentage of amino acids in an encoded sequence can also be "conservatively 

30 modified variations". One can also make a "conservative substitution" in a recombinant 

protein by utilizing one or more codons that differ from the codons employed by the native or 
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wild-type gene. In this instance, a conservative substitution also includes substituting a 
codon for an amino acid with a different codon for the same amino acid. 

As used herein/'control elements" or "regulatory sequences" include 
5 enhancers, promoters, transcription terminators, origins of replication, chromosomal 
integration sequences, 5' and 3' untranslated regions, with which proteins or other 
biomolecules interact to carry out transcription and translation. For eukaryotic cells, the 
control sequences will include a promoter and preferably an enhancer, e.g., derived from 
immunoglobulin genes, SV40, cytomegalovirus, and a polyadenylation sequence, and may 
10 include splice donor and acceptor sequences. Depending on the vector system and host 

utilized, any number of suitable transcription and translation elements, including constitutive 
and inducible promoters, may be used 

As used herein, a "derivatized" polynucleotide, oligonucleotide, or nucleic 

1 5 acid refers to oligo- and polynucleotides that comprise a derivatized substituent. In some 

embodiments, the substituent is substantially non-interfering with respect to hybridization to 
complementary polynucleotides. Derivatized oligo- or polynucleotides that have been 
modified with appended chemical substituents (e.g., by modification of an already 
synthesized oligo- or poly-nucleotide, or by incorporation of a modified base or backbone 

20 analog during synthesis) may be introduced into a metabolically active eukaryotic cell to 

hybridize with an hTRT DNA, RNA, or protein where they produce an alteration or chemical 
modification to a local DNA, RNA, or protein. Alternatively, the derivatized oligo or 
polynucleotides may interact with and alter hTRT polypeptides, telomerase-associated 
proteins, or other factors that interact with hTRT DNA or hTRT gene products, or alter or 

25 modulate expression or function of hTRT DNA, RNA or protein. Illustrative attached 
chemical substituents include: europium (III) texaphyrin, cross-linking agents, psoralen, 
metal chelates (e.g., iron/EDTA chelate for iron catalyzed cleavage), topoisomerases, 
endonucleases, exonucleases, ligases, phosphodiesterases, photodynamic porphyrins, 
chemotherapeutic drugs (e.g., adriamycin, doxirubicin), intercalating agents, base- 

30 modification agents, immunoglobulin chains, and oligonucleotides. Iron/EDTA chelates are 
chemical substituents often used where local cleavage of a polynucleotide sequence is desired 
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(Hertzberg et al., 1982, J. Am. Chem. Soc. 104: 313; Hertzberg and Dervan, 1984, 
Biochemistry 23: 3934; Taylor et al., 1984, Tetrahedron 40: 457; Dervan, 1986, Science 232: 
464. Illustrative attachment chemistries include: direct linkage, e.g., via an appended reactive 
amino group (Corey and Schultz (1988) Science 238: 1401, which is incorporated herein by 
5 reference) and other direct linkage chemistries, although streptavidin/biotin and 

digoxigenin/anti-digoxigenin antibody linkage methods can also be used. Methods for 
linking chemical substituents are provided in U.S. Patents 5,135,720, 5,093,245, and 
5,055,556, which are incorporated herein by reference. Other linkage chemistries may be 
used at the discretion of the practitioner. 

10 

As used herein, a "detectable label" has the ordinary meaning in the art and 
refers to an atom (e.g., radionuclide), molecule (e.g., fluorescein), or complex, that is or can 
be used to detect (e.g., due to a physical or chemical property), indicate the presence of a 
molecule or to enable binding of another molecule to which it is covalently bound or 

1 5 otherwise associated. The term "label" also refers to covalently bound or otherwise 

associated molecules (e.g., a biomolecule such as an enzyme) that act on a substrate to 
produce a detectable atom, molecule or complex. Detectable labels suitable for use in the 
present invention include any composition detectable by spectroscopic, photochemical, 
biochemical, immunochemical, electrical, optical or chemical means. Labels useful in the 

20 present invention include biotin for staining with labeled streptavidin conjugate, magnetic 
beads (e.g., Dynabeads™), fluorescent dyes (e.g., fluorescein, Texas red, rhodamine, green 
fluorescent protein, enhanced green fluorescent protein, lissamine, phycoerythrin, Cy2, Cy3, 
Cy3.5, Cy5, Cy5.5, Cy7, FluorX [Amersham], SyBR Green I & II [Molecular Probes], and 
the like), radiolabels (e.g., 3 H, 125 1, 35 S, 14 C, or 32 P), enzymes ( e.g., hydrolases, particularly 

25 phosphatases such as alkaline phosphatase, esterases and glycosidases, or oxidoreductases, 
particularly peroxidases such as horse radish peroxidase, and others commonly used in 
ELISAs), substrates, cofactors, inhibitors, chemiluminescent groups, chromogenic agents, 
and colorimetric labels such as colloidal gold or colored glass or plastic (e.g., polystyrene, 
polypropylene, latex, etc.) beads. Patents teaching the use of such labels include U.S. Patent 

30 Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; and 4,366,241. 

Means of detecting such labels are well known to those of skill in the art. Thus, for example, 
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radiolabels and chemiluminescent labels may be detected using photographic film or 
scintillation counters, fluorescent markers may be detected using a photodetector to detect 
emitted light (e.g., as in fluorescence-activated cell sorting). Enzymatic labels are typically 
detected by providing the enzyme with a substrate and detecting the reaction product 
5 produced by the action of the enzyme on the substrate, and colorimetric labels are detected by 
simply visualizing the colored label. Thus, a label is any composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, electrical, optical or chemical 
means. The label may be coupled directly or indirectly to the desired component of the assay 
according to methods well known in the art. Non-radioactive labels are often attached by 
1 0 indirect means. Generally, a ligand molecule (e.g., biotin) is covalently bound to the 

molecule. The ligand then binds to an anti-ligand (e.g., streptavidin) molecule which is either 
inherently detectable or covalently bound to a signal generating system, such as a detectable 
enzyme, a fluorescent compound, or a chemiluminescent compound. A number of ligands 
and anti-ligands can be used. Where a ligand has a natural anti-ligand, for example, biotin, 
1 5 thyroxine, and Cortisol, it can be used in conjunction with the labeled, naturally occurring 

anti-ligands. Alternatively, any haptenic or antigenic compound can be used in combination 
with an antibody. The molecules can also be conjugated directly to signal generating 
compounds, e.g., by conjugation with an enzyme or fluorophore. Means of detecting labels 
are well known to those of skill in the art. Thus, for example, where the label is a radioactive 
20 label, means for detection include a scintillation counter, photographic film as in 

autoradiography, or storage phosphor imaging. Where the label is a fluorescent label, it may 
be detected by exciting the fluorochrome with the appropriate wavelength of light and 
detecting the resulting fluorescence. The fluorescence may be detected visually, by means of 
photographic film, by the use of electronic detectors such as charge coupled devices (CCDs) 
25 or photomultipliers and the like. Similarly, enzymatic labels may be detected by providing 
the appropriate substrates for the enzyme and detecting the resulting reaction product. Also, 
simple colorimetric labels may be detected by observing the color associated with the label. 
It will be appreciated that when pairs of fluorophores are used in an assay, it is often preferred 
that the they have distinct emission patterns (wavelengths) so that they can be easily 
30 distinguished. 
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The phrase "elevated level" refers to an amount of hTRT gene product (or 
other specified substance or activity) in a cell that is elevated or higher than the level in a 
reference standard, e.g., for diagnosis, the level in normal, telomerase-negative cells in an 
individual or in other individuals not suffering from the condition, and for prognosis, the 
5 level in tumor cells from a variety of grades or classes of, e.g., tumors. 

As used herein, the term "epitope" has its ordinary meaning of a site on an 
antigen recognized by an antibody. Epitopes are typically segments of amino acids which are 
a small portion of the whole protein. Epitopes may be conformational (/.e., discontinuous). 
10 That is, they may be formed from amino acids encoded by noncontiguous parts of a primary 
sequence that have been juxtaposed by protein folding. 

The terms "favorable prognosis" and "unfavorable prognosis" are known in 
the art. In the context of cancers, "favorable prognosis" means that there is a likelihood of 
15 tumor regression or longer survival times for patients with a favorable prognosis relative to 
those with unfavorable prognosis, whereas "unfavorable prognosis" means that the tumor is 
likely to be more aggressive, i.e., grow faster and/or metastasize, resulting in a poor outcome 
or a more rapid course of disease progression for the patient. 

20 As used herein, the term "fusion protein," refers to a composite protein, i.e., a 

single contiguous amino acid sequence, made up of two (or more) distinct, heterologous 
polypeptides which are not normally fused together in a single amino acid sequence. Thus, a 
fusion protein may include a single amino acid sequence that contains two entirely distinct 
amino acid sequences or two similar or identical polypeptide sequences, provided that these 

25 sequences are not normally found together in the same configuration in a single amino acid 
sequence found in nature. Fusion proteins may generally be prepared using either 
recombinant nucleic acid methods, i.e., as a result of transcription and translation of a 
recombinant gene fusion product, which fusion comprises a segment encoding a polypeptide 
of the invention and a segment encoding a heterologous protein, or by chemical synthesis 

30 methods well known in the art. The non-hTRT region(s) of the fusion protein can be fused to 
the amino terminus of the hTRT polypeptide or the carboxyl terminus, or both or the non- 
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hTRT region can be inserted into the interior of the protein sequence (by moiety inserting or 
by replacing amino acids) or combinations of the foregoing can be performed. 

As used herein, the term "gene product" refers to an RNA molecule 
5 transcribed from a gene, or a protein encoded by the gene or translated from the RNA. 

As used herein, "hTR" (human telomerase RNA) refers to the RNA 
component of human telomerase and any naturally occurring alleles and variants or 
recombinant variants. hTR is described in detail in U.S. Patent No. 5,583,016 which is 
1 0 incorporated herein by reference in its entirety and for all purposes. 

As used herein, the term "immortal," when referring to a cell, has its normal 
meaning in the telomerase art and refers to cells that have apparently unlimited replicative 
potential. Immortal can also refer to cells with increased proliferative capacity relative to 
1 5 their unmodified counterparts. Examples of immortal human cells are malignant tumor cells, 
germ line cells, and certain transformed human cell lines cultured in vitro (e.g., cells that have 
become immortal following transformation by viral oncogenes or otherwise). In contrast, 
most normal human somatic cells are mortal, i.e., have limited replicative potential and 
become senescent after a finite number of cell divisions. 

20 

As used herein, the terms "immunogen" and "immunogenic" have their 
ordinary meaning in the art, i.e 9 an immunogen is a molecule, such as a protein or other 
antigen, that can elicit an adaptive immune response upon injection into a person or an 
animal. 

25 

As used herein, "isolated," when referring to a molecule or composition, such 
as, for example, an RNP (e.g., at least one protein and at least one RNA), means that the 
molecule or composition is separated from at least one other compound, such as a protein, 
other RNAs, or other contaminants with which it is associated in vivo or in its naturally 
30 occurring state. Thus, an RNP is considered isolated when the RNP has been isolated from 
any other component with which it is naturally associated, e.g., cell membrane, as in a cell 
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extract. An isolated composition can, however, also be substantially pure. 

As used herein, "modulator" refers to any synthetic or natural compound or 
composition that can change in any way either or both the "full" or any "partial activity" of a 
5 telomerase reverse transcriptase (TRT). A modulator can be an agonist or an antagonist. A 
modulator can be any organic and inorganic compound; including, but not limited to, for 
example, small molecules, peptides, proteins, sugars, nucleic acids, fatty acids and the like. 

As used herein,"motif ' refers to a sequence of contiguous amino acids (or to a 
1 0 nucleic acid sequence that encodes a sequence of contiguous amino acids) that defines a 
feature or structure in a protein that is common to or conserved in all proteins of a defined 
class or type. The motif or consensus sequence may include both conserved and 
non-conserved residues. The conserved residues in the motif sequence indicate that the 
conserved residue or class (i.e., hydrophobic, polar, non-polar, or other class) of residues is 
1 5 typically present at the indicated location in each protein (or gene or mRNA) of the class of 
proteins defined by the motif. Motifs can differ in accordance with the class of proteins. 
Thus, for example, the reverse transcriptase enzymes form a class of proteins than can be 
defined by one or more motifs, and this class includes telomerase enzymes. However, the 
telomerase enzymes can also be defined as the class of enzymes with motifs characteristic for 
20 that class. Those of skill recognize that the identification of a residue as a conserved residue 
in a motif does not mean that every member of the class defined by the motif has the 
indicated residue (or class of residues) at the indicated position, and that one or more 
members of the class may have a different residue at the conserved position. 

25 As used herein, the terms "nucleic acid" and "polynucleotide" are used 

interchangeably. Use of the term "polynucleotide" is not intended to exclude 
oligonucleotides (i.e., short polynucleotides) and can also refer to synthetic and/or non- 
naturally occurring nucleic acids (i.e., comprising nucleic acid analogues or modified 
backbone residues or linkages). 

30 

As used herein "oligonucleotides" or "oligomers" refer to a nucleic acid 
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sequence of approximately 7 nucleotides or greater, and as many as approximately 100 
nucleotides, which can be used as a primer, probe or amplimer. Oligonucleotides are often 
between about 10 and about 50 nucleotides in length, more often between about 14 and about 
35 nucleotides, very often between about 15 and about 25 nucleotides, and the terms 
5 oligonucleotides or oligomers can also refer to synthetic and/or non-naturally occurring 
nucleic acids (i.e., comprising nucleic acid analogues or modified backbone residues or 
linkages). 



1 0 between two or more nucleic acid (e.g., DNA) segments: for example, a promoter or enhancer 
is operably linked to a coding sequence if it stimulates the transcription of the sequence in an 
appropriate host cell or other expression system. Generally, sequences that are operably 
linked are contiguous, and in the case of a signal sequence both contiguous and in reading 
phase. However, enhancers need not be located in close proximity to the coding sequences 

1 5 whose transcription they enhance. 

As used herein, the term "polypeptide" is used interchangeably herein with 
the term "protein," and refers to a polymer composed of amino acid residues linked by amide 
linkages, including synthetic, naturally-occurring and non-naturally occurring analogs thereof 
20 (amino acids and linkages). Peptides are examples of polypeptides. 

As used herein, a "probe" refers to a molecule that specifically binds another 
molecule. One example of a probe is a "nucleic acid probe" that specifically binds (i.e., 
anneals or hybridizes) to a substantially complementary nucleic acid. Another example of a 
25 probe is an "antibody probe" that specifically binds to a corresponding antigen or epitope. 

As used herein, "recombinant" refers to a polynucleotide synthesized or 
otherwise manipulated in vitro (e.g., "recombinant polynucleotide")* to methods of using 
recombinant polynucleotides to produce gene products in cells or other biological systems, or 
30 to a polypeptide ("recombinant protein") encoded by a recombinant polynucleotide. 



As used herein, the term "operably linked," refers to a functional relationship 
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As used herein, a "selection system," in the context of stably transformed cell 
lines, refers to a method for identifying and/or selecting cells containing a recombinant 
nucleic acid of interest. A large variety of selection systems are known for identification of 
transformed cells and are suitable for use with the present invention. For example, cells 
5 transformed by plasmids or other vectors can be selected by resistance to antibiotics conferred 
by genes contained on the plasmids, such as the well known amp, gpt, neo and hyg genes, or 
other genes such as the herpes simplex virus thymidine kinase (Wigler et al., Cell 1 1 :223-32 
[1977]) and adenine phosphoribosyltransferase (Lowy et aL, Cell 22:817 [1980]) genes which 
can be employed in tk- or aprt- cells, respectively. Also, antimetabolite, antibiotic or 

10 herbicide resistance can be used as the basis for selection; for example, dhfr which confers 
resistance to methotrexate and is also useful for gene amplification (Wigler et al., Proa Natl 
Acad. Sc/., 77:3567 [1980]); npt, which confers resistance to the aminoglycosides neomycin 
and G-418 (Colbere-Garapin et al, J. Mol Biol, 150:1 [1981]) and als or pat, which confer 
resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively (Murry, in 

1 5 McGraw Hill Yearbook of Science and Technology, McGraw Hill, New York NY, pp 

191-196, [1992]). Additional selectable genes have been described, for example, hygromycin 
resistance-conferring genes, trpB, which allows cells to utilize indole in place of tryptophan, 
or hisD, which allows cells to utilize histinol in place of histidine (Hartman and Mulligan, 
Proa Natl Acad. Set, 85:8047 [1988]). Recently, the use of visible markers has gained 

20 popularity with such markers as anthocyanins, beta-glucuronidase and its substrate, GUS, and 
luciferase and its substrate, luciferin, being widely used not only to identify transformants, 
but also to quantify the amount of transient or stable protein expression attributable to a 
specific vector system (Rhodes et al., Meth Mol Biol, 55:121 [1995]). 

25 As used herein, the "sequence" of a gene (unless specifically stated 

otherwise), nucleic acid, protein, or peptide refers to the order of nucleotides in either or both 
strands of a double-stranded DNA molecule, e.g., the sequence of both the coding strand and 
its complement, or of a single-stranded nucleic acid molecule, or to the order of amino acids 
in a peptide or protein. 

30 

As used herein, "specific binding" refers to the ability of one molecule, 
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typically an antibody or polynucleotide, to contact and associate with another specific jj 
molecule even in the presence of many other diverse molecules. For example, a single- !j 
stranded polynucleotide can specifically bind to a single-stranded polynucleotide that is j 
complementary in sequence, and an antibody specifically binds to (or "is specifically j 
5 immunoreactive with") its corresponding antigen. 

As used herein, "stringent hybridization conditions" or "stringency" refers 
to conditions in a range from about 5 °C to about 20 °C or 25 °C below the melting 
temperature (T m ) of the target sequence and a probe with exact or nearly exact 

10 complementarity to the target. As used herein, the melting temperature is the temperature at 
which a population of double-stranded nucleic acid molecules becomes half-dissociated into 
single strands. Methods for calculating the T m of nucleic acids are well known in the art (see, 
e.g., Berger and Kimmel (1987) Methods in Enzymology, Vol. 152: Guide to 
Molecular Cloning Techniques, San Diego: Academic Press, Inc. and Sambrook et aL 

15 (1989) Molecular Cloning: A Laboratory Manual, 2nd Ed., Vols. 1-3, Cold Spring 
Harbor Laboratory hereinafter, "Sambrook"), both incorporated herein by reference). As 
indicated by standard references, a simple estimate of the T m value may be calculated by the 
equation: T ra = 81 .5 + 0.4 1(% G + C), when a nucleic acid is in aqueous solution at 1 M 
NaCl {see e.g., Anderson and Young, Quantitative Filter Hybridization in NUCLEIC ACID 

20 Hybridization (1985)). Other references include more sophisticated computations which 
take structural as well as sequence characteristics into account for the calculation of T m . The 
melting temperature of a hybrid (and thus the conditions for stringent hybridization) is 
affected by various factors such as the length and nature (DNA, RNA, base composition) of 
the probe and nature of the target (DNA, RNA, base composition, present in solution or 

25 immobilized, and the like), and the concentration of salts and other components (e.g., the 

presence or absence of formamide, dextran sulfate, polyethylene glycol). The effects of these 
factors are well known and are discussed in standard references in the art, e.g., Sambrook, 
supra and Ausubel et aL supra. Typically, stringent hybridization conditions are salt 
concentrations less than about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion at 

30 pH 7.0 to 8.3, and temperatures at least about 30°C for short probes (e.g., 1 0 to 50 

nucleotides) and at least about 60°C for long probes (e.g., greater than 50 nucleotides). As 
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noted, stringent conditions may also be achieved with the addition of destabilizing agents 
such as formamide, in which case lower temperatures may be employed. 

As used herein, the term "substantial identity," "substantial sequence 
5 identity," or "substantial similarity" in the context of nucleic acids, refers to a measure of 
sequence similarity between two polynucleotides. Substantial sequence identity can be 
determined by hybridization under stringent conditions, by direct comparison, or other 
means. For example, two polynucleotides can be identified as having substantial sequence 
identity if they are capable of specifically hybridizing to each other under stringent 

10 hybridization conditions. Other degrees of sequence identity (e.g., less than "substantial") 

can be characterized by hybridization under different conditions of stringency. Alternatively, 
substantial sequence identity can be described as a percentage identity between two 
nucleotide (or polypeptide) sequences. Two sequences are considered substantially identical 
when they are at least about 60% identical, preferably at least about 70% identical, or at least 

1 5 about 80% identical, or at least about 90% identical, or at least about 95% or 98% to 1 00% 
identical. Percentage sequence (nucleotide or amino acid) identity is typically calculated by 
determining the optimal alignment between two sequences and comparing the two sequences. 
For example an exogenous transcript used for protein expression can be described as having a 
certain percentage of identity or similarity compared to a reference sequence (e.g., the 

20 corresponding endogenous sequence). Optimal alignment of sequences may be conducted 
using the local homology algorithm of Smith and Waterman (1981) Adv. Appl Math. 2: 482, 
by the homology alignment algorithm of Needleman and Wunsch (1970) J. Mol Biol. 48: 
443, by the search for similarity method of Pearson and Lipman (1988) Proc. Natl. Acad. Sci. 
U.S. A. 85: 2444, by computerized implementations of these algorithms (GAP, BESTFIT, 

25 FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer 

Group, 575 Science Dr., Madison, WI), or by inspection. The best alignment (i.e., resulting 
in the highest percentage of identity) generated by the various methods is selected. Typically 
these algorithms compare the two sequences over a "comparison window" (usually at least 18 
nucleotides in length) to identify and compare local regions of sequence similarity, thus 

30 allowing for small additions or deletions (i.e., gaps). Additions and deletions are typically 20 
percent or less of the length of the sequence relative to the reference sequence, which does 
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not comprise additions or deletions. It is sometimes desirable to describe sequence identity 
between two sequences in reference to a particular length or region (e.g., two sequences may 
be described as having at least 95% identity over a length of at least 500 basepairs). Usually 
the length will be at least about 50, 100, 200, 300, 400 or 500 basepairs, amino acids, or other 
5 residues. The percentage of sequence identity is calculated by comparing two optimally 
aligned sequences over the region of comparison, determining the number of positions at 
which the identical nucleic acid base (e.g., A, T, C, G, or U) occurs in both sequences to yield 
the number of matched positions, and determining the number (or percentage) of matched 
positions as compared to the total number of bases in the reference sequence or region of 

10 comparison. An additional algorithm that is suitable for determining sequence similarity is 
the BLAST algorithm, which is described in Altschul (1990) J. Mol Biol 215: 403-410; and 
Shpaer (1996) Genomics 38:179-191. Software for performing BLAST analyses is publicly 
available at the National Center for Biotechnology Information 
(http://www.ncbi.nlm.nih.gov/). This algorithm involves first identifying high scoring 

1 5 sequence pairs (HSPs) by identifying short words of length W in the query sequence that 

either match or satisfy some positive- valued threshold score T when aligned with a word of 
the same length in a database sequence. T is referred to as the neighborhood word score 
threshold (Altschul et a/, supra.). These initial neighborhood word hits act as seeds for 
initiating searches to find longer HSPs containing them. The word hits are extended in both 

20 directions along each sequence for as far as the cumulative alignment score can be increased. 
Extension of the word hits in each direction are halted when: the cumulative alignment score 
falls off by the quantity X from its maximum achieved value; the cumulative score goes to 
zero or below, due to the accumulation of one or more negative-scoring residue alignments; 
or the end of either sequence is reached. The BLAST algorithm parameters W, T and X 

25 determine the sensitivity and speed of the alignment. The BLAST program uses as defaults a 
wordlength (W) of 1 1, the BLOSUM62 scoring matrix (see Henikoff (1992) Proa Natl 
Acad Set USA 89: 10915-10919) alignments (B) of 50, expectation (E) of 10, M=5, N=-4, 
and a comparison of both strands. The term BLAST refers to the BLAST algorithm which 
performs a statistical analysis of the similarity between two sequences; see, e.g., Karlin 

30 (1993) Proc. Natl Acad. Set USA 90:5873-5787. One measure of similarity provided by the 
BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the 
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probability by which a match between two nucleotide or amino acid sequences would occur 
by chance. For example, a nucleic acid can be considered similar to a TRT nucleic acid if the 
smallest sum probability in a comparison of the test nucleic acid to an TRT nucleic acid is 
less than about 0.5, 0.2, 0.1, 0.01, or 0.001. Alternatively, another indication that two nucleic 
5 acid sequences are similar is that the polypeptide which the first nucleic acid encodes is 

immunologically cross reactive with the polypeptide encoded by the second nucleic acid. It 
will be recognized that homologous non-human TRT polynucleotides may have less that 
"substantial" nucleotide identity in certain regions, as the term "substantial identity" is 
defined herein. For example, Euplotes TRT is substantially less than about 60% identical to 
10 the hTRT polynucleotide of Seq. ID. No. 1 in certain regions, although the two genes are 
homologs. 

As used herein, the terms "substantial identity," "substantial sequence 
identity,"or "substantial similarity" in the context of a polypeptide, refers to a degree of 

1 5 similarity between two polypeptides in which a polypeptides comprises a sequence with at 

least 70% sequence identity to a reference sequence, or 80%, or 85% or up to 100% sequence 
identity to the reference sequence, or most preferably 90% identity over a comparison 
window of about 10-20 amino acid residues. Amino acid sequence similarity, or sequence 
identity, is determined by optimizing residue matches, if necessary, by introducing gaps as 

20 required. See Needleham et al. (1970) J. Mol Biol 48: 443-453; and Sankoff et aL, 1983, 
Time Warps, String Edits, and Macromolecules, The Theory and Practice of Sequence 
Comparison, Chapter One, Addison- Wesley, Reading, MA; and software packages from 
IntelliGenetics, Mountain View, CA, and the University of Wisconsin Genetics Computer 
Group, Madison, WL As will be apparent to one of skill, the terms "substantial identity", 

25 "substantial similarity" and "substantial sequence identity" can be used interchangeably with 
regard to polypeptides or polynucleotides. It will be recognized that homologous non-human 
TRT polypeptides may have less that "substantial" sequence identity in certain regions, as the 
term "substantial identity" is defined herein. For example, Euplotes TRT protein is 
substantially less than about 60% identical to the hTRT polynucleotide of Seq. ID. No. 2 in 

30 certain regions, although the two genes are homologs. In the context of TRT polypeptides 
from different species, for example, "significant homology" at the amino acid sequence 
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means at least about 20% sequence identity in region of about 20 to about 40 residues, or at 
least about 40% sequence identity in region of at least about 20% sequence identity. 

As used herein, the term "substantially pure," or "substantially purified," 

5 when referring to a composition comprising a specified reagent, such as an antibody (e.g. an 
anti-hTRT antibody), means that the specified reagent is at least about 75%, or at least about 
90%, or at least about 95%, or at least about 99% or more of the composition (not including, 
e.g., solvent or buffer). Thus, for example, a preferred immunoglobulin preparation of the 
invention that specifically binds an hTRT polypeptide is substantially purified. 

10 

As used herein, a "telomerase negative" cell is one in which telomerase is not 
expressed, i.e., no telomerase catalytic activity can be detected using a conventional assay or 
a TRAP assay for telomerase catalytic activity. As used herein, a "telomerase positive" cell 
is a cell in which telomerase is expressed (i.e. telomerase activity can be detected). 

15 

As used herein, a "telomerase-related" disease or condition is a disease or 
condition in a subject that is correlated with an abnormally high level of telomerase activity 
in cells of the individual, which can include any telomerase activity at all for most normal 
somatic cells, or which is correlated with a low level of telomerase activity that results in 
20 impairment of a normal cell function. Examples of telomerase-related conditions include, 
e.g., cancer (high telomerase activity in malignant cells) and infertility (low telomerase 
activity in germ-line cells). 

As used herein, "test compound" or "agent" refers to any synthetic or natural 
25 compound or composition. The term includes all organic and inorganic compounds; 

including, for example, small molecules, peptides, proteins, sugars, nucleic acids, fatty acids 
and the like. 



XIIL EXAMPLES 

30 The following examples are provided to illustrate the present invention, and 

not by way of limitation. 
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In the following sections, the following abbreviations apply: eq (equivalents); 
M (Molar); uM (micromolar); N (Normal); mol (moles); mmol (millimoles); umol 
(micromoles); nmol (nanomoles); g (grams); mg (milligrams); ug (micrograms); ng 
(nanograms); 1 or L (liters); ml (milliliters); ul (microliters); cm (centimeters); mm 
(millimeters); urn (micrometers); nm (nanometers); °C (degrees Centigrade); RPN 
(ribonucleoprotein); mreN (2'-0-methylribonucleotides); dNTP (deoxyribonucleotide); dH 2 0 
(distilled water); DDT (dithiothreitol); PMSF (phenylmethylsulfonyl fluoride); TE (10 mM 
Tris HC1, 1 mM EDTA, approximately pH 7.2); KGlu (potassium glutamate); SSC (salt and 
sodium citrate buffer); SDS (sodium dodecyl sulfate); PAGE (polyacrylamide gel 
electrophoresis); Novex (Novex, San Diego, CA); BioRad (Bio-Rad Laboratories, Hercules, 
CA); Pharmacia (Pharmacia Biotech, Piscataway, NJ); Boehringer-Mannheim (Boehringer- 
Mannheim Corp., Concord, CA); Amersham (Amersham, Inc., Chicago, IL); Stratagene 
(Stratagene Cloning Systems, La Jolla, CA); NEB (New England Biolabs, Beverly, MA); 
Pierce (Pierce Chemical Co., Rockford, IL); Beckman (Beckman Instruments, Fullerton, 
CA); Lab Industries (Lab Industries, Inc., Berkeley, CA); Eppendorf (Eppendorf Scientific, 
Madison, WI); and Molecular Dynamics (Molecular Dynamics, Sunnyvale, CA). 

EXAMPLE 1 

ISOLATION OF TELOMERASE PROTEINS AND CLONES 

The following example details the isolation of telomerase proteins and clones 
from various organisms, including the euplotes p. 123, hTRT, TRT and S. pombe TRT 
telomerase cDNA clones. 
A. Background 

i) Introduction 

This section provides an overview of the purification and cloning of TRT 
genes, which is described in greater detail in subsequent sections of this Example. While 
telomerase RNA subunits have been identified in ciliates, yeast and mammals, protein 
subunits of the enzyme have not been identified as such prior to the present invention. 
Purification of telomerase from the ciliated protozoan Euplotes aediculatus yielded two 
proteins, termed pl23 and p43 (see infra; Lingner (1996) Proc. Natl. Acad. Sci. U.S.A. 
93:10712). Euplotes aediculatus is a hypotrichous ciliate having a macronucleus containing 
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about 8 x 10 7 telomeres and about 3 x 10 s molecules of telomerase. After purification, the 
active telomerase complex had a molecular mass of about 230 kD, corresponding to a 66 kD 
RNA subunit and two proteins of about 123 kD and 43 kD (Lingner (1996) supra). 
Photocross-linking experiments indicated that the larger pi 23 protein was involved in 

5 specific binding of the telomeric DNA substrate (Lingner, (1 996) supra). 

The pl23 and p43 proteins were sequenced and the cDNA clones which 
encoded these proteins were isolated. These Euplotes sequences were found to be unrelated to 
the Tetrahymena telomerase-associated proteins p80 and p95. Sequence analysis of the 
Euplotes pi 23 revealed reverse transcriptase (RT) motifs. Furthermore, sequence analysis of 

1 0 the Euplotes pl23 by comparison to other sequences revealed a yeast homolog, termed Est2 
protein (Lingner (1997) Science 276:561). Yeast Est2 had previously been shown to be 
essential for telomere maintenance in vivo (Lendvay (1996) Genetics 144:1399) but had not 
been identified as a telomerase catalytic protein. Site-specific mutagenesis demonstrated that 
the RT motifs of yeast Est2 are essential for telomeric DNA synthesis in vivo and in vitro 

1 5 (Lingner (1997) supra). 

ii) Identifying and Characterizing S. pombe Telomerase 

PCR amplification of S. pombe DNA was carried out with degenerate 
sequence primers designed from the Euplotes pl23 RT motifs as described below. Of the 
four prominent PCR products generated, a 120 base pair band encoded a peptide sequence 
20 homologous to p 1 23 and Est2. This PCR product was used as a probe in colony 

hybridization and identified two overlapping clones from an S. pombe genomic library and 
three from an S. pombe cDNA library. Sequence analysis revealed that none of the three S. 
pombe cDNA clones was full length, so RT-PCR was used to obtain the sequences encoding 
the protein's N-terminus. 

25 Complete sequencing of these clones revealed a putative S. pombe telomerase 

RT gene, trt\. The complete nucleotide sequence of trt\ has been deposited in GenBank, 

accession number AFO 15783 (see Figure 15). 

To test S. pombe trtl (as a catalytic subunit, two deletion constructs were 

created. Analysis of the sequence showed that trtl encoded a basic protein with a predicted 
30 molecular mass of 1 1 6 kD. It was found that homology with pi 23 and Est2 was especially 

high in the seven reverse transcriptase motifs, underlined and designated as motifs 1, 2, A, B, 
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C, D, and E (see Figure 63). An additional telomerase-specific motif, designated the T- 
motif, was also found* Fifteen introns, ranging in size from 36 to 71 base pairs, interrupted 
the coding sequence. 

To test S. pombe trt\ as a catalytic subunit, two deletion constructs were 
5 created. One removed only motifs B through D in the RT domains. The second removed 
99% of the open reading frame. 

Haploid cells grown from S. pombe spores of both mutants showed 
progressive telomere shortening to the point where hybridization to telomeric repeats became 
almost undetectable. A trt\ V trt\ ' diploid was sporulated and the resulting tetrads were 

1 0 dissected and germinated on a yeast extract medium supplemented with amino acids (a YES 
plate, Alfa (1993) Experiments with Fission Yeast, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, NY). Colonies derived from each spore were grown at 32°C for three 
days, and streaked successively to fresh YES plates every three days. A colony from each 
round was placed in six ml of YES liquid culture at 32°C and grown to stationary phase. 

1 5 Genomic DNA was prepared. After digestion with Apal, DNA was subjected to 
electrophoresis on a 2.3% agarose gel, stained with ethidium bromide to confirm 
approximately equal loading in each lane, then transferred to a nylon membrane and 
hybridized to a telomeric DNA probe. 

Senescence was indicated by the delayed onset of growth or failure to grow on 

20 agar (typically at the fourth streak-out after germination) and by colonies with increasingly 

ragged edges (colony morphology shown in Figure 22C) and by increasingly high fractions of 
elongated cells (as shown in Figure 22D). Cells were plated on Minimal Medium (Alfa 
(1993) supra) with glutamic acid substituted for ammonium chloride for two days at 32°C 
prior to photography. 

25 When individual enlarged cells were separated on the dissecting microscope, 

the majority were found to undergo no further division. The same telomerase negative (trtV) 
cell population always contained normal-sized cells which continued to divide, but which 
frequently produced non-dividing progeny. The telomerase-negative survivors may use a 
recombinational mode of telomere maintenance as documented in budding yeast strains that 

30 have various telomere-replication genes deleted (Lendvay (1996) supra, Lundblad (1993) 
Cell 73:347). 
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iii) Identifying and Characterizing Human Telomerase 

An EST (expressed sequence tag) derived from human telomerase reverse 
transcriptase (hTRT) cDNA was identified by a BLAST search of the dbEST (expressed 
sequence tag) Genbank database using the Euplotes 123 kDa peptide and nucleic acid 
5 sequences, as well as the Schizosaccharomyces protein and corresponding cDNA (tezl) 
sequences. The EST, designated Genbank AA28196, is 389 nucleotides long and it 
corresponds to positions 1679 to 2076 of clone 712562 (Figure 18), was obtained from the 
I.M.A.G.E. Consortium (Human Genome Center, DOE, Lawrence Livermore National 
Laboratory, Livermore, CA). This clone was obtained from a cDNA library of germinal B 

10 cells derived by flow sorting of tonsil cells. Complete sequencing of this hTRT cDNA clone 
showed all eight telomerase RT (TRT) motifs. However, this hTRT clone did not encode a 
contiguous portion of a TRT because RT motifs B', C, D, and E, were contained in a different 
open reading frame than the more N-terminal RT motifs. In addition, the distance between 
RT motifs A and B was substantially shorter than that of the three previously known (non- 

15 human) TRTs. 

To isolate a full length cDNA clone, a cDNA library derived form the human 
293 cell line (described above) which expresses high levels of telomerase activity, was 
screened. A lambda cDNA library from the 293 cell line was partitioned into 25 pools 
containing about 200,000 plaques each. Each pool was screened by PCR with the primer pair 

20 5-CGGAAGAGTGTCTGGAGCAA-3' and 5 f -GGATGAAGCGGAGTCTGGA-3'. Six 

subpools of one positive primary pool were further screened by PCR using this same primer 
pair. For both the primary and the secondary subpool screening, hTRT was amplified for a 
total of 31 cycles at: 94°C, 45 seconds; 60°C, 45 seconds; and 72°C, 90 seconds. As a 
control, RNA of the house-keeping enzyme GAPDH was amplified using the primer pair 5 ! - 

25 CTCAGACACCA 

TGGGGAAGGTGA-3 ' and S'-ATGATCTTGAGGCTGTTGTCATA-S' for a total of 16 
cycles at 94°C, 45 seconds; 55°C, 45 seconds; and 72°C, 90 seconds. 

One hTRT positive subpool from the secondary screening was then screened 
by plaque hybridization with a probe from the 5 1 region of clone #712562. One phage was 

30 positively identified (designated Lambda phage 25-1.1, ATCC 209024, deposited May 1 2, 
1997). It contained an approximately four kilobase insert, which was excised and subcloned 
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into the EcoRI site of pBluescript II SK+ vector (Stratagene, San Diego, CA) as an EcoRI 
fragment. This cDNA clone-containing plasmid was designated pGRNl 21. ThecDNA 
insert totals approximately 4 kilobasepairs. The complete nucleotide sequence of the human 
hTRT cDNA (pGRN121) has been deposited in Genbank (accession AF015950) and the 
5 plasmid has been deposited with the ATCC (ATCC 209016, deposited May 6, 1997). 

B. Growth of Euplotes aediculatus 

In this Example, cultures of E. aediculatus were obtained from Dr. David 
Prescott, MCDB, University of Colorado. Dr. Prescott originally isolated this culture from 
10 pond water, although this organism is also available from the ATCC (ATCC #30859). 

Cultures were grown as described by S wanton et al y (Swanton et al 9 Chromosoma 77:203 
[1980]), under non-sterile conditions, in 15-liter glass containers containing Chlorogonium as 
a food source. Organisms were harvested from the cultures when the density reached 
approximately 10 4 cells/ml. 

15 

C. Preparation of Nuclear Extracts 

In this Example, nuclear extracts of E. aediculatus were prepared using the 
method of Lingner et al, (Lingner et aL, Genes Develop., 8:1984 [1994]), with minor 
modifications, as indicated below. Briefly, cells grown as described in Part B were 

20 concentrated with 1 5 jam Nytex filters and cooled on ice. The cell pellet was resuspended in 
a final volume of 1 10 ml TMS/PMSF/spermidine phosphate buffer. The stock 
TMS/PMSF/spermidine phosphate buffer was prepared by adding 0.075 g spermidine 
phosphate (USB) and 0.75 ml PMSF (from 100 mM stock prepared in ethanol) to 150 ml 
TMS. TMS comprised 10 mM Tris-acetate, 10 mM MgCl 2 , 85.5752 g sucrose/liter, and 

25 0.33297 g CaCl 2 /liter, pH 7.5. 

After resuspension in TMS/PMSF/spermidine phosphate buffer, 8.8 ml 10% 
NP-40 and 94.1 g sucrose were added and the mixture placed in a siliconized glass beaker 
with a stainless steel stirring rod attached to an overhead motor. The mixture was stirred until 
the cells were completely lysed (approximately 20 minutes). The mixture was then 

30 centrifuged for 10 minutes at 7500 rpm (8950 x g), at 4°C, using a Beckman JS-13 swing-out 
rotor. The supernatant was removed and nuclei pellet was resuspended in 
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TMS/PMSF/spermidine phosphate buffer, and centrifuged again, for 5 minutes at 7500 rpm 
(8950 x g), at 4°C, using a Beckman JS-13 swing-out rotor. 

The supernatant was removed and the nuclei pellet was resuspended in a 
buffer comprised of 50 mM Tris-acetate, 10 mM MgCl 2 , 10% glycerol, 0.1% NP-40, 0.4 M 
5 KGlu, 0.5 mM PMSF, pH 7.5, at a volume of 0.5 ml buffer per 10 g of harvested cells. The 
resuspended nuclei were then dounced in a glass homogenizer with approximately 50 strokes, 
and then centrifuged for 25 minutes at 14,000 rpm at 4°C, in an Eppendorf centrifuge. The 
supernatant containing the nuclear extract was collected, frozen in liquid nitrogen, and stored 
at -80 °C until used. 

10 

D. Purification of Telomerase 

In this Example, nuclear extracts prepared as described in Part C were used to purify 

E. aediculatus telomerase. In this purification protocol, telomerase was first enriched by 
chromatography on an Affi-Gel-heparin column, and then extensively purified by affinity 

1 5 purification with an antisense oligonucleotide. As the template region of telomerase RNA is 
accessible to hybridization in the telomerase RNP particle, an antisense oligonucleotide (i.e., 
the "affinity oligonucleotide") was synthesized that was complementary to this template 
region as an affinity bait for the telomerase. A biotin residue was included at the 5' end of the 
oligonucleotide to immobilize it to an avidin column. 

20 Following the binding of the telomerase to the oligonucleotide, and extensive 

washing, the telomerase was eluted by use of a displacement oligonucleotide. The affinity 
oligonucleotide included DNA bases that were not complementary to the telomerase RNA 5' 
to the telomerase-specific sequence. As the displacement oligonucleotide was 
complementary to the affinity oligonucleotide for its entire length, it was able to form a more 

25 thermodynamically stable duplex than the telomerase bound to the affinity oligonucleotide. 
Thus, addition of the displacement oligonucleotide resulted in the elution of the telomerase 
from the column. 

The nuclear extracts prepared from 45 liter cultures were frozen until a total of 
34 ml of nuclear extract was collected. This corresponded to 630 liters of culture (i.e., 
30 approximately 4 x 10 9 cells). The nuclear extract was diluted with a buffer to 410 ml, to 
provide final concentrations of 20 mM Tris-acetate, 1 mM MgCl 2 > 0.1 mM EDTA, 33 mM 
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KGlu, 10% (vol/vol) glycerol, 1 mM dithiothreitol (DTT), and 0.5 mM phenylmethylsulfonyl 
fluoride (PMSF), at a pH of 7.5. 

The diluted nuclear extract was applied to an Affi-Gel-heparin gel column 
(Bio-Rad), with a 230 ml bed volume and 5 cm diameter, equilibrated in the same buffer and 

5 eluted with a 2-liter gradient from 33 to 450 mM KGlu. The column was run at 4°C, at a 
flow rate of 1 column volume/hour. Fractions of 50 mis each were collected and assayed for 
telomerase activity as described in Part E. Telomerase was eluted from the column at 
approximately 170 mM KGlu. Fractions containing telomerase (approximately 440 ml) were 
pooled and adjusted to 20 mM Tris-acetate, 10 mM MgCl 2 , 1 mM EDTA, 300 mM KGlu, 

10 10% glycerol, 1 mM DTT, and 1% Nonidet P-40. This buffer was designated as "WB." 

To this preparation, 1.5 nmol of each of two competitor DNA oligonucleotides 
(5 ! -TAGACCTGTTAGTGTACATTTGAATTGAAGC-3' (and (5'- 
TAGACCTGTTAGGTTGGATTTGTGGCATCA-3', 50 |ig yeast RNA (Sigma), and 0.3 
nmol of biotin-labeled telomerase-specific oligonucleotide (5 T -biotin-TAGACCTGTTA- 

1 5 (mreG) 2 <rnieU) 4 -(rmeG) 4 -(rmeU)4-remG-3 ! ), were added per ml of the pool. The 2-0- 

methyribonucleotides of the telomerase specific oligonucleotides were complementary to the 
the telomerase RNA; template region; the deoxyribonucleotides were not complementary. 
The inclusion of competitor, non-specific DNA oligonucleotides increased the efficiency of 
the purification, as the effects of nucleic acid binding proteins and other components in the 

20 mixture that would either bind to the affinity oligonucleotide or remove the telomerase from 
the mixture were minimized. 

This material was then added to Ultralink immobilized neutravidin plus 
(Pierce) column material, at a volume of 60 \il of suspension per ml of pool. The column 
material was pre-blocked twice for 1 5 minutes each blocking, with a preparation of WB 

25 containing 0.01% Nonidet P-40, 0.5 mg BSA, 0.5 mg/ml lysozyme, 0.05 mg/ml glycogen, 
and 0.1 mg/ml yeast RNA. The blocking was conducted at 4°C, using a rotating wheel to 
block the column material thoroughly. After the first blocking step, and before the second 
blocking step, the column material was centrifiiged at 200 x g for 2 minutes to pellet the 
matrix. 

30 The pool-column mixture was incubated for 8 minutes at 30°C, and then for 

an additional 2 hours at 4°C, on a rotating wheel (approximately 10 rpm; Labindustries) to 
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allow binding. The pool-column mixture was then centrifuged 200 xg for 2 minutes, and the 
supernatant containing unbound material was removed. The pool-column mixture was then 
washed. This washing process included the steps of rinsing the pool-column mixture with 
WB at 4°C, washing the mixture for 1 5 minutes with WB at 4°C, rinsing with WB, washing 

5 for 5 minutes at 30°C, with WB containing 0.6 M KGlu, and no Nonidet P-40, washing 5 
minutes at 25 °C with WB, and finally, rinsing again with WB. The volume remaining after 
the final wash was kept small, in order to yield a ratio of buffer to column material of 
approximately 1:1. 

Telomerase was eluted from the column material by adding 1 nmol of 

1 0 displacement deoxyoligonucleotide (5 , -CA 4 C 4 A 4 C 2 TA 2 CAG 2 TCTA-3') 3 per ml of column 
material and incubating at 25 °C for 30 minutes. The material was centrifuged for 2 minutes 
at 14,000 rpm in a microcentrifuge (Eppendorf), and the eluate collected. The elution 
procedure was repeated twice more, using fresh displacement oligonucleotide each time. As 
mentioned above, because the displacement oligonucleotide was complementary to the 

1 5 affinity oligonucleotide, it formed a more thermodynamically stable complex with the affinity 
oligonucleotide than P-40. Thus, addition of the displacement oligonucleotide to an affinity- 
bound telomerase resulted in efficient elution of telomerase under native conditions. The 
telomerase appeared to be approximately 50% pure at this stage, as judged by analysis on a 
protein gel. The affinity purification of telomerase and elution with a displacement 

20 oligonucleotide is shown in Figure 26 (panels A and B, respectively). In this Figure, the 2'- 
Omethyl sugars of the affinity oligonucleotide are indicated by the bold line. The black and 
shaded oval shapes in this Figure are intended to represent graphically the protein subunits of 
the present invention. 

The protein concentrations of the extract and material obtained following Affi- 

25 Gel-heparin column chromatography were determined using the method of Bradford 

(Bradford, Anal. Biochem., 72:248 [1976]), using BSA as the standard. Only a fraction of 
the telomerase preparation was further purified on a glycerol gradient. 

The sedimentation coefficient of telomerase was determined by glycerol 
gradient centrifugation, as described in Part I. 

30 Table 5 below is a purification table for telomerase purified according to the 

methods of this Example. The telomerase was enriched 12-fold in nuclear extracts, as 
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compared to whole cell extracts, with a recovery of 80%; 85% of telomerase was solubilized 
from nuclei upon extraction. 



TableS. Purification of Telomerase 



5 


Fraction 


Protein (mg) 


Telomerase 


Telomerase/ 


Recovery 


Purification 








(pmol of 


Protein/pmol 


(%) 


Factor 








RNP) 


ofRNP/mg 








Nuclear 


2020 


1720 


0.9 


100 


1 




Extract 














Heparin 


125 


1040 


8.3 


60 


10 




Affinity 


0.3** 


680 


2270 


40 


2670 


10 


Glycerol 
Gradient 


NA* 


NA* 


NA* 


25 


NA* 



*NA=Not available 

**This value was calculated from the measured amount of telomerase (680 pmol), by 
assuming a purity of 50% (based on a protein gel). 
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E. Telomerase Activity 

At each step in the purification of telomerase, the preparation was analyzed by 
three separate assays, one of which was activity, as described in this Example. In general, 
telomerase assays were done in 40 \il containing 0.003-0.3 [il of nuclear extract, 50 mM Tris- 

20 CI (pH 7.5), 50 mM KGlu, 10 mM MgCl 2 , 1 mM DTT, 125 ^M dTTP, 125 ^iM dGTP, and 
approximately 0.2 pmoles of 5*- 32 P-labelled oligonucleotide substrate (Le. t approximately 
400,000 cpm). Oligonucleotide primers were heat-denatured prior to their addition to the 
reaction mixture. Reactions were assembled on ice and incubated for 30 minutes at 25 °C. 
The reactions were stopped by addition of 200 \il of 10 mM Tris-Cl (pH 7.5), 15 mM EDTA, 

25 0.6% SDS, and 0.05 mg/ml proteinase K, and incubated for at least 30 minutes at 45 °C. 
After ethanol precipitation, the products were analyzed on denaturing 8% PAGE gels, as 
known in the art (See e,g„ Sambrook et al t 1989). 
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F. Quantitation of Telomerase Activity 

In this Example, quantitation of telomerase activity through the purification 
procedure is described. Quantitation was accomplished by assaying the elongation of 
oligonucleotide primers in the presence of dGTP and [a - 32 P]dTTP. Briefly, 1 \xM 5'-(G 4 T 4 ) 2 - 
3' oligonucleotide was extended in a 20 |il reaction mixture in the presence of 2 \xl of [a - 
32 P]dTTP (10 mCi/ml, 400 Ci/mmol; 1 Ci=37 GBq), and 125 ^iM dGTP as described 
(Lingner etal, Genes Develop., 8:1984 [1994]) and loaded onto an 8% PAGE sequencing gel 
as described. 

The results of this study are shown in Figure 28. In lane 1, there is no 
telomerase present (i.e., a negative control); lanes 2, 5, 8, and 1 1 contained 0.14 finol 
telomerase; lanes 3, 6, 9, and 12 contained 0.42 finol telomerase; and lanes 4, 7, 10, and 13 
contained 1.3 fmol telomerase. Activity was quantitation using a Phosphorlmager (Molecular 
Dynamics) using the manufacturer's instructions. It was determined that under these 
conditions, 1 fmol of affinity-purified telomerase incorporated 21 finol of dTTP in 30 
minutes. 

As shown in Figure 28, the specific activity of the telomerase did not change 
significantly through the purification procedure. Affinity-purified telomerase was fully 
active. However, it was determined that at high concentrations, an inhibitory activity was 
detected and the activity of crude extracts was not linear. Thus, in the assay shown in Figure 
28, the crude extract was diluted 700-7000-fold. Upon purification, this inhibitory activity 
was removed and no inhibitory effect was detected in the purified telomerase preparations, 
even at high enzyme concentrations. 

G. Gel Electrophoresis and Northern Blots 

As stated in Part E, at each step in the purification of telomerase, the 
preparation was analyzed by three separate assays. This Example describes the gel 
electrophoresis and blotting procedures used to quantify telomerase RNA present in fractions 
and analyze the integrity of the telomerase ribonucleoprotein particle. 

i) Denaturing Gels and Northern Blots 

In this Example, synthetic T7-transcribed telomerase RNA of known 
concentration served as the standard. Throughout this investigation, the RNA component 
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was used as a measure of telomerase. 

A construct for phage T7 RNA polymerase transcription of E. aediculatus 
telomerase RNA was produced, using (PGR). The telomerase RNA gene was amplified with 
primers that annealed to either end of the gene. The primer that annealed at the 5' end also 
encoded a hammerhead ribozyme sequence to generate the natural 5' end upon cleavage of 
the transcribed RNA, a T7-promoter sequence, and an EcoRI site for subcloning. The 
sequence of this 5' primer was 5'-GCGGGAATTCTAA 

TACGACTCACTATAGGGAAGAAACTCTGATGAGGCCGAAAGGCCGAAACTCCAC 
GAAAGTGGAGTAAGTTTCTCGATAATTGATCTGTAG-3\ The 3' primer included an 
Earl site for termination of transcription at the natural 3' end, and a BamUI site for cloning. 
The sequence of this 3' primer was 5'-CGGGGATCCTCTTCAAAAG 
ATGAGAGGAC AGC AAAC-3\ The PCR amplification product was cleaved with EcoKL 
and 5amHI, and subcloned into the respective sites of pUC19 (NEB), to give M pEaT7." The 
correctness of this insert was confirmed by DNA sequencing. T7 transcription was 
performed as described by Zaug et al, Biochemistry 33:14935 [1994], with £arl-linearized 
plasmid. RNA was gel-purified and the concentration was determined (an A 260 of 1 = 40 
^ig/ml). This RNA was used as a standard to determine the telomerase RNA present in 
various preparations of telomerase. 

The signal of hybridization was proportional to the amount of telomerase 
RNA, and the derived RNA concentrations were consistent with, but slightly higher than 
those obtained by native gel electrophoresis. Comparison of the amount of whole telomerase 
RNA in whole cell RNA to serial dilutions of known T7 RNA transcript concentrations 
indicated that each E. aediculatus cell contained approximately 300,000 telomerase 
molecules. 

Visualization of the telomerase was accomplished by Northern blot 
hybridization to its RNA component, using methods as described (Linger et al, Genes 
Develop., 8:1984 [1994]). Briefly, RNA (less than or equal to 0.5 |ig/lane) was resolved on 
an 8% PAGE and electroblotted onto a Hybond-N membrane (Amersham), as known in the 
art (see e.g., Sambrook et al, 1989). The blot was hybridized overnight in 10 ml of 4x SSC, 
lOx Denhardt's solution, 0.1% SDS, and 50 ^g/ml denatured herring sperm DNA. After pre- 
hybridizing for 3 hours, 2 x 10 6 cpm probe/ml hybridization solution was added. The 



203 



# • 

randomly labelled probe was a PCR-product that covered the entire telomerase RNA gene. 
The blot was washed with several buffer changes for 30 minutes in 2x SSC, 0.1% SDS, and 
then washed for 1 hour in O.lx SSC and 0.1% SDS at 45°C. 

5 ii) Native Gels and Northern Blots 

In this experiment, the purified telomerase preparation was run on native (i.e., 
non-denaturing) gels of 3.5% polyacrylamide and 0.33% agarose, as known in the art and 
described (Lamond and Sproat, [1994], supra). The telomerase comigrated approximately 
with the xylene cyanol dye. 

1 o The native gel results indicated that telomerase was maintained as an RNP 

throughout the purification protocol. Figure 27 is a photograph of a Northern blot showing 
the mobility of the telomerase in different fractions on a non-denaturing gel as well as in vitro 
transcribed telomerase. In this figure, lane 1 contained 1.5 fmol telomerase RNA, lane 2 
contained 4.6 fmol telomerase RNA, lane 3 contained 14 fmol telomerase RNA, lane 4 

15 contained 41 fmol telomerase RNA, lane 5 contained nuclear extract (42 fmol telomerase), 

lane 6 contained Affi-Gel-heparin-purified telomerase (47 fmol telomerase), lane 7 contained 
affinity-purified telomerase (68 fmol), and lane 8 contained glycerol gradient-purified 
telomerase (35 fmol). 

As shown in Figure 27, in nuclear extracts, the telomerase was assembled into 

20 an RNP particle that migrated slower than unassembled telomerase RNA. Less than 1% free 
RNA was detected by this method. However, a slower migrating telomerase RNP complex 
was also sometimes detected in extracts. Upon purification on the Affi-Gel-heparin column, 
the telomerase RNP particle did not change in mobility (Figure 27, lane 6). However, upon 
affinity purification the mobility of the RNA particle slightly increased (Figure 27, lane 7), 

25 perhaps indicating that a protein subunit or fragment had been lost. On glycerol gradients, 
the affinity-purified telomerase did not change in size, but approximately 2% free telomerase 
RNA was detectable (Figure 27, lane 8), suggesting that a small amount of disassembly of the 
RNP particle had occurred. 

30 H. Telomerase Protein Composition 

In this Example, the analysis of the purified telomerase protein composition 
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are described. 

Glycerol gradient fractions obtained as described in Part D, were separated on 
a 4-20% polyacrylamide gel (Novex). Following electrophoresis, the gel was stained with 
Coomassie brilliant blue. Figure 29 shows a photograph of the gel. Lanes 1 and 2 contained 
molecular mass markers (Pharmacia) as indicated on the left side of the gel shown in Figure 
29. Lanes 3-5 contained glycerol gradient fraction pools as indicated on the top of the gel 
(i.e., lane 3 contained fractions 9-14, lane 4 contained fractions 15-22, and lane 5 contained 
fractions 23-32). Lane 4 contained the pool with 1 pmol of telomerase RNA. In lanes 6-9 
BSA standards were run at concentrations indicated at the top of the gel in Figure 29 (i.e., 
lane 6 contained 0.5 pmol BSA, lane 7 contained 1.5 pmol BSA, lane 8 contained 4.5 BSA, 
and lane 9 contained 15 pmol BSA). 

As shown in Figure 29, polypeptides with molecular masses of 120 and 43 
kDa co-purified with the telomerase. The 43 kDa polypeptide was observed as a doublet. It 
was noted that the polypeptide of approximately 43 kDa in lane 3 migrated differently than 
the doublet in lane 4; it may be an unrelated protein. The 120 kDa and 43 kDa doublet each 
stained with Coomassie brilliant blue at approximately the level of 1 pmol, when compared 
with BSA standards. Because this fraction contained 1 pmol of telomerase RNA, all of which 
was assembled into an RNP particle (See, Figure 27, lane 8), there appear to be two 
polypeptide subunits that are stoichiometric with the telomerase RNA. However, it is also 
possible that the two proteins around 43 kDa are separate enzyme subunits. 

Affinity-purified telomerase that was not subjected to fractionation on a 
glycerol gradient contained additional polypeptides with apparent molecular masses of 35 and 
37 kDa, respectively. This latter fraction was estimated to be at least 50% pure. However, 
the 35 kDa and 37 kDa polypeptides that were present in the affinity-purified material were 
not reproducibly separated by glycerol gradient centrifugation. These polypeptides may be 
contaminants, as they were not visible in all activity-containing preparations. 

I. Sedimentation Coefficient 

The sedimentation coefficient for telomerase was determined by glycerol 
gradient centrifugation. In this Example, nuclear extract and affinity-purified telomerase 
were fractionated on 15-40% glycerol gradients containing 20 mM Tris-acetate, with 1 mM 



205 



# • 

MgCl 2 , 0.1 mM EDTA, 300 mM KGlu, and 1 mM DTT, at pH 7.5. Glycerol gradients were 
poured in 5 ml (13 x 51 mm) tubes, and centrifuged using an SW55Ti rotor (Beckman) at 
55,000 rpm for 14 hours at 4°C. 

Marker proteins were run in a parallel gradient and had a sedimentation 
coefficient of 7.6 S for alcohol dehydrogenase (ADH), 113 S for catalase, 17.3 S for 
apoferritin, and 19.3 S for thyroglobulin. The telomerase peak was identified by native gel 
electrophoresis of gradient fractions followed by blot hybridization to its RNA component. 

Figure 30 is a graph showing the sedimentation coefficient for telomerase. As 
shown in this Figure, affinity-purified telomerase co-sedimented with catalase at 1 1.5 S, 
while telomerase in nuclear extracts sedimented slightly faster, peaking around 12.5 S. 
Therefore, consistent with the mobility of the enzyme in native gels, purified telomerase 
appears to have lost a proteolytic fragment or a loosely associated subunit. 

The calculated molecular mass for telomerase, if it is assumed to consist of 
one 120 kDa protein subunit, one 43 kDa subunit, and one RNA subunit of 66 kDa, adds up 
to a total of 229 kDa. This is in close agreement with the 232 kDa molecular mass of 
catalase. However, the sedimentation coefficient is a function of the molecular mass, as well 
as the partial specific volume and the factional coefficient of the molecule, both of which are 
unknown for the Euplotes telomerase RNP. 

J. Substrate Utilization 

In this Example, the substrate requirements of Euplotes telomerase were 
investigated. One simple model for DNA end replication predicts that after semi- 
conservative DNA replication, telomerase extends double-stranded, blunt-ended DNA 
molecules. In a variation of this model, a single-stranded 3 r end is created by a helicase or 
nuclease after replication. This 3' end is then used by telomerase for binding and extension. 

To determine whether telomerase is capable of elongating blunt-ended 
molecules, model hairpins were synthesized with telomeric repeats positioned at their 3' ends. 
These primer substrates were gel-purified, 5 ! -end labelled with polynucleotide kinase, heated 
at 0.4 |iM to 80 °C for 5 minutes, and then slowly cooled to room temperature in a heating 
block, to allow renaturation and helix formation of the hairpins. Substrate mobility on a non- 
denaturing gel indicated that very efficient hairpin formation was present, as compared to 
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dimerization. 

Assays were performed with unlabelled 125 |iM dGTP, 125 |iM dTTP, and 
0.02 fiM 5'-end-labelled primer (5 ? - 32 P-labelled oligonucleotide substrate) in 10 |al reaction 
mixtures that contained 20 mM Tris-acetate, with 10 raM MgCl 2 , 50 mM KGlu, and 1 mM 
5 DTT, at pH 7.5. These mixtures were incubated at 25 °C for 30 minutes. Reactions were 
stopped by adding formamide loading buffer (i.e., TBE, formamide, bromthymol blue, and 
cyanol, Sambrook, 1989, supra). 

Primers were incubated without telomerase ("-"), with 5.9 fmol of affinity- 
purified telomerase ("+"), or with 17.6 fmol of affinity-purified telomerase ("+++"). Affinity- 
1 0 purified telomerase used in this assay was dialyzed with a membrane having a molecular cut- 
off of 100 kDa, in order to remove the displacement oligonucleotide. Reaction products were 
separated on an 8% PAGE/urea gel containing 36% formamide, to denature the hairpins. The 
sequences of the primers used in this study, as well as their lane assignments are shown in 
Table 6. 
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TABLE 6. Primer Sequences 



Lane 


Primer Sequence (5' to 3') 


1-3 


C 4 (A 4 C 4 ) 3 CACA(G 4 T 4 ) 3 G 4 


4-6 


C 2 (A 4 C 4 ) 3 CACA(G 4 T 4 ) 3 G 4 


7-9 


(A 4 C 4 ) 3 CACA(G 4 T 4 ) 3 G 4 


10-12 


A 2 C 4 (A 4 C 4 ) 2 CACA(G 4 T 4 ) 3 G 4 


13-15 


C 4 (A 4 C 4 ) 2 CACA(G 4 T 4 ) 3 


16-18 


(A 4 C 4 ) 3 CACA(G 4 T 4 ) 3 


19-21 


AjC^C^CACACG^), 


22-24 


C 4 (A 4 C 4 ) 2 CACA(G 4 T 4 ) 3 


25-27 


C 2 (A 4 C 4 ) 2 CACA(G 4 T 4 ) 3 


28-30 


(A 4 C 4 ) 2 CACA(G 4 T 4 ) 3 



The gel results are shown in Figure 3 1 . Lanes 1-15 contained substrates with 
30 telomeric repeats ending with four G residues. Lanes 16-30 contained substrates with 

telomeric repeats ending with four T residues. The putative alignment on the telomerase 
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RNA template is indicated in Figure 32. It was assumed that the primer sets anneal at two 
very different positions in the template shown in Figure 32 (i.e., Panel A and Panel B, 
respectively). This may have affected their binding and/or elongation rate. 

Figure 33 shows a lighter exposure of lanes 25-30 in Figure 31. The lighter 
exposure of Figure 33 was taken to permit visualization of the nucleotides that are added and 
the positions of pausing in elongated products. Percent of substrate elongated for the third 
lane in each set was quantified on a Phosphorlmager, as indicated on the bottom of Figure 31. 

The substrate efficiencies for these hairpins were compared with double- 
stranded telomere-like substrates with overhangs of differing lengths. A model substrate that 
ended with four G residues (see lanes 1-15 of Figure 3 1) was not elongated when it was blunt 
ended (see lanes 1-3). However, slight extension was observed with an overhang length of 
two bases; elongation became efficient when the overhang was at least 4 bases in length. The 
telomerase acted in a similar manner with a double-stranded substrate that ended with four T 
residues, with a 6-base overhang required for highly efficient elongation. In Figure 31, the 
faint bands below the primers in lanes 10-15 that are independent of telomerase represent 
shorter oligonucleotides in the primer preparations. 

The lighter exposure of lanes 25-30 in Figure 33 shows a ladder of elongated 
products, with the darkest bands correlating with the putative 5' boundary of the template (as 
described by Lingner et al, Genes Develop., 8:1984 [1994]). The abundance of products that 
correspond to other positions in the template suggested that pausing and/or dissociation 
occurs at sites other than the site of translocation with the purified telomerase. 

As shown in Figure 31, double-stranded, blunt-ended oligonucleotides were 
not substrates for telomerase. To determine whether these molecules would bind to 
telomerase, a competition experiment was performed. In this experiment, 2 nM of 5 '-end 
labeled substrate with the sequence (G 4 T 4 ) 2 , or a hairpin substrate with a six base overhang 
were extended with 0. 125 nM telomerase (Figure 3 1 , lanes 25-27). Although the same 
unlabeled oligonucleotide substrates competed efficiently with labeled substrate for 
extension, no reduction of activity was observed when the double-stranded blunt-ended 
hairpin oligonucleotides were used as competitors, even in the presence of 100-fold excess 
hairpins. 

These results indicated that double-stranded, blunt-ended oligonucleotides 
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cannot bind to telomerase at the concentrations and conditions tested in this Example. 
Rather, a single-stranded 3' end is required for binding. It is likely that this 3 ! end is required 
to base pair with the telomerase RNA template. 

K. Cloning & Sequencing of the 123 kDa Polypeptide 

In this Example, the cloning of the 123 kDa polypeptide of Euplotes 
telomerase (i.e., the 123 kDa protein subunit) is described. In this study, an internal fragment 
of the telomerase gene was amplified by PGR, with oligonucleotide primers designed to 
match peptide sequences that were obtained from the purified polypeptide obtained in Part D, 
above. The polypeptide sequence was determined using the nanoES tandem mass 
spectroscopy methods known in the art and described by Calvio et al, RNA 1 :724-733 
[1995]. The oligonucleotide primers used in this Example had the following sequences, with 
positions that were degenerate shown in parentheses--5 f -TCT(G/A) 
AA(G/A)TA(G/A)TG^ and 
5'-GCGGATCCATGAA(T/C^ 

A 50 |il reaction contained 0.2 mM dNTPs, 0.15 \ig E. aediculatus 
chromosomal DNA, 0.5 |il Taq (Boehringer-Mannheim), 0.8 jig of each primer, and lx 
reaction buffer (Boehringer-Mannheim). The reaction was incubated in a thermocycler 
(Perkin-Elmer), using the following-5 minutes at 95 °C, followed by 30 cycles of 1 minute at 
94°C, 1 minute at 52°C, and 2 minutes at 72 °C. The reaction was completed by a 10 minute 
incubation at 72 °C. 

A genomic DNA library was prepared from the chromosomal E. aediculatus 
DNA by cloning blunt-ended DNA into the Smal site of pCR-Script plasmid vector 
Figure 14(Stratagene). This library was screened by colony hybridization, with the 
radiolabeled, gel-purified PCR product. Plasmid DNA of positive clones was prepared and 
sequenced by the dideoxy method (Sanger et al, Proc. Natl. Acad. Sci., 74:5463 [1977]) or 
manually, through use of an automated sequencer (ABI). The DNA sequence of the gene 
encoding this polypeptide is shown in Figure 13. The start codon in this sequence inferred 
from the DNA sequence, is located at nucleotide position 101, and the open reading frame 
ends at position 3193. The genetic code of Euplotes differs from other organisms in that the 
"UGA" codon encodes a cysteine residue. The amino acid sequence of the polypeptide 
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inferred from the DNA sequence is shown in Figure 14, and assumes that no unusual amino 
acids are inserted during translation and no post-translational modification occurs. 

L. Cloning & Sequencing of the 43 kDa Polypeptide 

5 In this Example, the cloning of the 43 kDa polypeptide of telomerase (z.e., the 

43 kDa protein subunit) is described. In this study, an internal fragment of the corresponding 
telomerase gene was amplified by PCR, with oligonucleotide primers designed to match 
peptide sequences that were obtained from the purified polypeptide obtained in Part D, above. 
The polypeptide sequence was determined using the nanoES tandem mass spectroscopy 

10 methods known in the art and described by Calvio et al, supra. The oligonucleotide primers 
used in this Example had the following sequences--5'- 

NNNGTNAC(C/T/A)GG(C/T/A)AT(C/T/A)AA(C/T)AA-3 f , and 5'-(T/G/A)GC 
(T/G/A)GT(C/T)TC(T/C)TG(G/A)TC(G/A)TT(G/A)TA-3 f . In this sequence, "N" indicates 
the presence of any of the four nucleotides (/.e., A, T, G, or C). 

1 5 The PCR was performed as described in Part K. 

A genomic DNA library was prepared and screened as described in Part K. 
The DNA sequence of the gene encoding this polypeptide is shown in Figure 34. Three 
potential reading frames are shown for this sequence, as shown in Figure 35. For clarity, the 
amino acid sequence is indicated below the nucleotide sequence in all three reading frames. 

20 These reading frames are designated as "a," M b," and "c". A possible start codon is encoded at 
nucleotide position 84 in reading frame "c." The coding region could end at position 1501 in 
reading frame "b." Early stop codons, indicated by asterisks in this figure, occur in all three 
reading frames between nucleotide position 337-350. 

The "La-domain" is indicated in bold-face type. Further downstream, the 

25 protein sequence appears to be encoded by different reading frames, as none of the three 
frames is uninterrupted by stop codons. Furthermore, peptide sequences from purified 
protein are encoded in all three frames. Therefore, this gene appears to contain intervening 
sequences, or in the alternative, the RNA is edited. Other possibilities include ribosomal 
frame-shifting or sequence errors. However, the homology to the La-protein sequence 

30 remains of significant interest. Again, in Euplotes, the "UGA" codon encodes a cysteine 
residue. 

210 



mini nun 



M. Amino Acid and Nucleic Acid Comparisons 

In this Example, comparisons between various reported sequences and the 
sequences of the 123 kDa and 43 kDa telomerase subunit polypeptides were made. 

i) Comparisons with the 123 kDa E. aediculatus Telomerase Subunit 

The amino acid sequence of the 123 kDa Euplotes aediculatus polypeptide 
was compared with the sequence of the 80 kDa telomerase protein subunit of Tetrahymena 
thermophila (GenBank accession #U25641) to investigate their similarity. The nucleotide 
sequence as obtained from GenBank encoding this protein is shown in Figure 42. The amino 
acid sequence of this protein as obtained from GenBank is shown in Figure 43. The sequence 
comparison between the 123 kDa E. aediculatus and 80 kDa T. thermophila is shown in 
Figure 36. In this figure, the E. aediculatus sequence is the upper sequence, while the T. 
thermophila sequence is the lower sequence. The observed identity was determined to be 
approximately 19%, while the percent similarity was approximately 45%, values similar to 
what would be observed with any random protein sequence. In Figures 36-39, identities are 
indicated by vertical bars, while single dots between the sequences indicate somewhat similar 
amino acids, and double dots between the sequences indicate more similar amino acids. 

The amino acid sequence of the 123 kDa Euplotes aediculatus polypeptide was also 
compared with the sequence of the 95 kDa telomerase protein subunit of Tetrahymena 
thermophila (GenBank accession #U25642), to investigate their similarity. The nucleotide 
sequence as obtained from GenBank encoding this protein is shown in Figure 44. The amino 
acid sequence of this protein as obtained from GenBank is shown in Figure 45. This 
sequence comparison is shown in Figure 37. In this figure, the E. aediculatus sequence is the 
upper sequence), while the T. thermophila sequence is the lower sequence. The observed 
identity was determined to be approximately 20%, while the percent similarity was 
approximately 43%, values similar to what would be observed with any random protein 
sequence. 

Significantly, the amino acid sequence of the 123 kDa£ aediculatus 
polypeptide contains the five motifs characteristic of reverse transcriptases. The 123 kDa 
polypeptide was also compared with the polymerase domains of various reverse 
transcriptases. Figure 40 shows the alignment of the 123 kDa polypeptide with the putative 
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yeast homolog (L8543.12 or ESTp). The amino acid sequence of L8543.12 obtained from 
GenBank is shown in Figure 46. 

Four motifs (A, B, C, and D) were included in this comparison. In this Figure 
40, highly conserved residues are indicated by white letters on a black background. Residues 
of the E. aediculatus sequences that are conserved in the other sequence are indicated in bold; 
the M h" indicates the presence of a hydrophobic amino acid. The numerals located between 
amino acid residues of the motifs indicates the length of gaps in the sequences. For example, 
the "100" shown between motifs A and B reflects a 100 amino acid gap in the sequence 
between the motifs. 

As noted above, Genbank searches identified a yeast protein (Genbank 
accession #u20618), and gene L8543.12 (Est2) containing or encoding amino acid sequence 
that shows some homology to the K aediculatus 123 kDa telomerase subunit. Based on the 
observations that both proteins contain reverse transcriptase motifs in their C-terminal 
regions; both proteins share similarity in regions outside the reverse transcriptase motif; the 
proteins are similarly basic (pi = 10.1 for E. aediculatus and pl=10.0 for the yeast); and both 
proteins are large (123 kDa for E. aediculatus and 103 kDa for the yeast), these sequences 
comprise the catalytic core of their respective telomerases. It was contemplated based on this 
observation of homology in two phylogenetically distinct organisms as E. aediculatus and 
yeast, that human telomerase would contain a protein that has the same characteristics (i.e., 
reverse transcriptase motifs, is basic, and large [> 100 kDa]). 

ii) Comparisons with the 43 kDa E. aediculatus Telomerase Subunit 

The amino acid sequence of the "La-domain" of the 43 kDa Euplotes 
aediculatus polypeptide was compared with the sequence of the 95 kDa telomerase protein 
subunit of Tetrahymena thermophila (described above) to investigate their similarity. This 
sequence comparison is shown in Figure 38, while the T. thermophila sequence is the lower 
sequence. The observed identity was determined to be approximately 23%, while the percent 
similarity was approximately 46%, values similar to what would be observed with any 
random protein sequence. 

The amino acid sequence of the "La-domain" of the 43 kDa Euplotes 
aediculatus polypeptide was compared with the sequence of the 80 kDa telomerase protein 
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subunit of Tetrahymena thermophila (described above) to investigate their similarity. This 
sequence comparison is shown in Figure 39. In this figure, the E. aediculatus sequence is the 
upper sequence, while the T. thermophila sequence is the lower sequence. The observed 
identity was determined to be approximately 26%, while the percent similarity was 
approximately 49%, values similar to what would be observed with any random protein 
sequence. 

The amino acid sequence of a domain of the 43 kDa E. aediculatus 
polypeptide was also compared with La proteins from various other organisms. These 
comparisons are shown in Figure 41. In this Figure, highly conserved residues are indicated 
by white letters on a black background. Residues of the K aediculatus sequences that are 
conserved in the other sequence are indicated in bold. 

N. Identification of Telomerase Protein Subunits in Another Organism 

In this Example, the sequences identified in the previous Examples above were 
used to identify the telomerase protein subunits of Oxytricha trifallax, a ciliate that is very 
distantly related to E. aediculatus. Primers were chosen based on the conserved region of the 
E. aediculatus 123 kDa polypeptide which comprised the reverse transcriptase domain 
motifs. Suitable primers were synthesized and used in a PCR reaction with total DNA from 
Oxytricha. The Oxytricha DNA was prepared according to methods known in the art. The 
PCR products were then cloned and sequenced using methods known in the art. 

The oligonucleotide sequences used as the primers were as follows: 
5'-(T/C)A(A/G)A^ 
GG-3' and 5HG/A/T)GT(G/A/T)ATO^ 

Positions that were degenerate are shown in parentheses, with the alternative bases shown 
within the parenthesis. "N" represents any of the four nucleotides. 

In the PCR reaction, a 50 \x\ reaction contained 0.2 mM dNTPs, 0.3 |ig 
Oxytricha trifallax chromosomal DNA, 1 \il Taq polymerase (Boehringer-Mannheim), 2 
micromolar of each primer, lx reaction buffer (Boehringer-Mannheim). The reaction was 
incubated in a thermocycler (Perkin-Elmer) under the following conditions: 5 min at 95 °C, 
30 cycles consisting of 1 min at 94°C, 1 min at 53°C, and 1 min at 72°C, followed by a 10 
min incubation at 72 °C. The PCR-product was gel-purified and sequenced by the 
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dideoxy-method (e.g., Sanger et al, Proc. Natl. Acad. Sci. 74, 5463-5467 (1977). 

The deduced amino acid sequence of the PCR product was determined and 
compared with the E. aediculatus sequence. Figure 47 shows the alignment of these 
sequences, with the O. trifallax sequence shown in the top row, and the E. aediculatus 
sequence shown in the bottom row. As can be seen from this figure, there is a great deal of 
homology between the O. trifallax polypeptide sequence identified in this Example with the 
E. aediculatus polypeptide sequence. Thus, it is clear that the sequences identified in the 
present invention are useful for the identification of homologous telomerase protein subunits 
in other eukaryotic organisms. Indeed, development of the present invention has identified 
homologous telomerase sequences in multiple, diverse species, as described herein. 

O. Identification of Tetrahymena Telomerase Sequences 

In this Example, a Tetrahymena clone was produced that shares homology 
with the Euplotes sequences, and EST2p. 

This experiment utilized PCR with degenerate oligonucleotide primers 
directed against conserved motifs to identify regions of homology between Tetrahymena, 
Euplotes, and EST2p sequences. The PCR method used in this Example is a novel method 
designed to amplify specifically rare DNA sequences from complex mixtures. This method 
avoids the problem of amplification of DNA products with the same PCR primer at both ends 
(i.e., single primer products) commonly encountered in PCR cloning methods. These single 
primer products produce unwanted background and can often obscure the amplification and 
detection of the desired two-primer product. The method used in this experiment 
preferentially selects for two-primer products. In particular, one primer is biotinylated and 
the other is not. After several rounds of PCR amplification, the products are purified using 
streptavidin magnetic beads and two primer products are specifically eluted using heat 
denaturation. This method finds use in settings other than the experiments described in this 
Example. Indeed, this method finds use in application in which it is desired to specifically 
amplify rare DNA sequences, including the preliminary steps in cloning methods such as 5' 
and 3; RACE, and any method that uses degenerate primers in PCR. 

A first PCR run was conducted using Tetrahymena template macronuclear 
DNA isolated using methods known in the art, and the 24-mer forward primer with the 
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sequence 5' biotin-GCCTATTT(TC)TT(TC)TA(TC)(GATC)(GATC) 
(G ATC) AC(GATC)GA-3 ' designated as "K231," corresponding to the FFYXTE region, and 
the 23-mer reverse primer with the sequence 5'- CCAGATAT(GATC)A 
(TGA)(GATC)A(AG)(AG)AA(AG)TC(AG)TC- 3', designated as "K220," corresponding to 
the DDFL(FIL)I region. This PCR reaction contained 2.5 ul DNA (50 ng), 4 ul of each 
primer (20 uM), 3 ul lOx PCR buffer, 3 ul lOx dNTPs, 2 ul Mg, 0.3 ul Taq, and 1 1.2 ul 
dH 2 0. The mixture was cycled for 8 cycles of 94°C for 45 seconds, 37°C for 45 seconds, 

and 72 °C for 1 minute. 

This PCR reaction was bound to 200 ul streptavidin magnetic beads, washed 
with 200 ul TE, resuspended in 20 ul dH 2 0 and then heat-denatured by boiling at 100°C for 
2 minutes. The beads were pulled down and the eluate removed. Then, 2.5 ul of this eluate 
was subsequently reamplified using the above conditions, with the exception being that 0.3 ul 
of a - 32 P dATP was included, and the PCR was carried out for 33 cycles. This reaction was 
run a 5% denaturing polyacrylamide gel, and the appropriate region was cut out of the gel. 
These products were then reamplified for an additional 34 cycles, under the conditions listed 
above, with the exception being that a 42°C annealing temperature was used. 

A second PCR run was conducted using Tetrahymena macronuclear DNA 
template isolated using methods known in the art, and the 23-mer forward primer with the 
sequence 5'- ACAATG(CA)G(GATC)(TCA)T(GATC)(TCA)T(GATC)CC 
(GATC) AA(AG) AA-3 ' , designated as "K228," corresponding to the region R(LI)(LI)PKK , 
and a reverse primer with the sequence 5'-ACGAATC(GT)(GATC)GG 
(TAG)AT(GATC)(GC)(TA)(AG)TC(AG)TA(AG)CA 3' , designated "K224," corresponding 
to the CYDSIPR region. This PCR reaction contained 2.5 ul DNA (50 ng), 4 ul of each 
primer (20 uM), 3 ul lOx PCR buffer, 3 ul lOx dNTPs, 2 ul Mg, 0.3 ul a - 32 P dATP, 0.3 ul 
Taq, and 10.9 ul dH 2 0. This reaction was run on a 5% denaturing polyacrylamide gel, and 
the appropriate region was cut out of the gel. These products were reamplified for an 
additional 34 cycles, under the conditions listed above, with the exception being that a 42 °C 
annealing temperature was used. 

Ten ul of the reaction product from run 1 were bound to streptavidin-coated 
magnetic beads in 200 ul TE. The beads were washed with 200 ul TE, and then resuspended 
in 20 ul of dH 2 Q, heat denatured, and the eluate was removed. The reaction product from run 
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2 was then added to the beads and diluted with 30 |xl 0.5x SSC. The mixture was heated from 
94°C to 50°C. The eluate was removed and the beads were washed three times in 0.5x SSC 
at 55 °C. The beads were then resuspended in 20 ul dH 2 0, heat denatured, and the eluate was 
removed, designated as "round 1 eluate" and saved. 
5 To isolate the Tetrahymena band, the round 1 eluate was reamplified with the 

forward primer K228 and reverse primer K227 with the sequence 
5'- CAATTCTC(AG)TA(AG)CA(GATC)(CG)(TA)(CT)TT(AGT)AT(GA)TC-3' , 
corresponding to the DIKSCYD region. The PCR reactions were conducted as described 
above. The reaction products were run on a 5% polyacrylamide gel; the band corresponding 
10 to approximately 295 nucleotides was cut from the gel and sequenced. 

The clone designated as 168-3 was sequenced. The DNA sequence (including 
the primer sequences) was found to be: 

GATTACTCCCGAAGAAAGGATCTTTCCGTCCAATCATGACTTTCTTAAGAAAGGA 

CAAGCAAAAAAATATTAAGTTAAATCTAAATTAAATTCTAATGGATAGCCAACTT 
1 5 GTGTTTAGGAATTTAAAAGAC ATGCTGGG ATAAAAGATAGGATACTC AGTCTTTG 

ATAATAAACAAATTTCAGAAAAATTTGCCTAATTCATAGAGAAATGGAAAAATA 

AAGGAAGACCTCAGCTATATTATGTCACTCTAGACATAAAGACTTGCTAC. 

Additional sequence of this gene was obtained by PCR using one unique 

primer designed to match the sequence from 168-3 ("K297" with the sequence 
20 5'-GAGTGACATAATATACGTGA-3'; and the K23 1 (FFYXTE) primer. The sequence of 

the fragment obtained from this reaction, together with 168-3 is as follows (without the 

primer sequences): 

AAACACAAGGAAGGAAGTCAAATATTCTATTACCGTAAACCAATATGGAAATTA 
GTGAGTAAATTAACTATTGTCAAAGTAAGAATTTAGTTTTCTGAAAAGAATAAAT 

25 AAATGAAAAATAATTTTTATCAAAAAATTTAGCTTGAAGAGGAGAATTTGGAAA 
AAGTTGAAGAAAAATTGATACCAGAAGATTCATTTTAGAAATACCCTCAAGGAA 
AGCTAAGGATTATACCTAAAAAAGGATCTTTCCGTCCAATCATGACTTTCTTAAG 
AAAGGACAAGCAAAAAAATATTAAGTTAAATCTAAATTAAATTCTAATGGATAG 
CCAACTTGTGTTTAGGAATTTAAAAGACATGCTGGGATAAAAGATAGGATACTCA 

30 GTCTTTGATAATAAACAAATTTCAGAAAAATTTGCCTAATTCATAGAGAAATGGA 
AAAATAAAGGAAGACCTCAGCTATATTATGTCACTCTA. 
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The amino acid sequence corresponding to this DNA fragment was found to 

be: 

KHKEGSQIFYYRKPIWKLVSKLTIVKVRIQFSEKNKQMKNNFYQKIQLEEENLEKVEE 
KLIPEDSFQKYPQGKLRIIPKKGSFRPIMTFLRKDKQKNIKLNLNQILMDSQLVFRNLK 
5 DMLGQKIGYSVFDNKQISEKFAQFIEKWKNKGRPQLYYVTL. 

This amino acid sequence was then aligned with other telomerase genes 
(EST2p, and Euplotes). The alignment is shown in Figure 53. A consensus sequence is also 
shown in this Figure. 

10 P. Identification of Schizosaccharomyces pombe Telomerase Sequences 

In this Example, the tezl sequence of S. pombe was identified as a homolog of 
the E. aediculatus pi 23, and S. cerevisiae Est2p. 

Figure 55 provides an overall summary of these experiments. In this Figure, 
the top portion (Panel A) shows the relationship of two overlapping genomic clones, and the 
15 5 825 bp portion that was sequenced. The region designated at "tezl + " is the protein coding 
region, with the flanking sequences indicated as well, the box underneath the 5825 bp region 
is an approximately 2 kb tf/ndlll fragment that was used to make the tezl disruption 
construct, as described below. 

The bottom half of Figure 55 (Panel B) is a "close-up" schematic of this same 
20 region of DNA. The sequence designated as "original PCR" is the original degenerate PCR 
fragment that was generated with a degenerate oligonucleotide primer pair designed based on 
Euplotes sequence motif 4 (B') and motif 5 (C), as described. 

i) PCR With Degenerate Primers 

25 PCR using degenerate primers was used to find the homolog of the E. 

aediculatus pl23 in S. pombe. Figure 56 shows the sequences of the degenerate primers 
(designated as "poly 4" and "poly 1") used in this reaction. The PCR runs were conducted 
using the same buffer as described in previous Examples (See e.g., Part K, above), with a 5 
minute ramp time at 94°C, followed by 30 cycles of 94°C for 30 seconds, 50°C for 45 

30 seconds, and 72°C for 30 seconds, and 7 minutes at 72°C, followed by storage at 4°C. PCR 
runs were conducted using varied conditions, (i.e., various concentrations of S. pombe DNA 
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and MgCl 2 concentrations). The PCR products were run on agarose gels and stained with 
ethidium bromide as described above. Several PCR runs resulted in the production of three 
bands (designated as M T," "M," and "B"). These bands were re-amplified and run on gels 
using the same conditions as described above. Four bands were observed following this re- 
5 amplification ("T," "Ml M M2," and "B"), as shown in Figure 57. These four bands were 
then re-amplified using the same conditions as described above. The third band from the top 
of the lane in Figure 57 was identified as containing the correct sequence for a telomerase 
protein. The PCR product designated as M2 was found to show a reasonable match with 
other telomerase proteins, as indicated in Figure 58. In addition to the alignment shown, this 
10 Figure also shows the actual sequence of tezL In this Figure, the asterisks indicate residues 
shared with all four sequences {Oxytricha "Ot"; E. aediculatus n Ea_pl23"; S. cerevisiae 
"Sc_j)103 M ; and M2), while the circles (i.e., dots) indicate similar amino acid residues. 

ii) 3 f RT PCR 

15 To obtain additional sequence information, 3' and 5' RT PCR were conducted 

on the telomerase candidate identified in Figure 58. Figure 59 provides a schematic of the 3' 
RT PCR strategy used. First, cDNA was prepared from mRNA using the oligonucleotide 
primer "Q T ," (5 f -CCA GTG AGC AGA GTG ACG AGG ACT CGA GCT CAA GCT TTT 
TTT TTT TTT TT-3'), then using this cDNA as a template for PCR with "Q 0 " (5'-CCA GTG 

20 AGC AGA GTG ACG-3'), and a primer designed based on the original degenerated PCR 
reaction (i.e. f "M2-T" with the sequence 5'-G TGT CAT TTC TAT ATG GAA GAT TTG 
ATT GAT G-3 f ). The second PCR reaction (Le., nested PCR) with "Q," (5 T -GAG GAC TCG 
AGC TCA AGC-3'), and another PCR primer designed with sequence derived from the 
original degenerate PCR reaction or "M2-T2" (5"-AC CTA TCG TTT ACG AAA AAG AAA 

25 GGA TCA GTG-3'). The buffers used in this PCR were the same as described above, with 

amplification conducted beginning with a ramp up of 94° for 5 min, followed by 30 cycles of 
94° for 30 sec, 55°C for 30 sec, and 72°C for 3 min, followed by 7 minutes at 72°C. The 
reaction products were stored at 4°C until use. 

30 iii) Screening of Genomic and cDNA Libraries 

After obtaining this additional sequence information, several genomic and 
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cDNA libraries were screened to identify any libraries that contain this telomerase candidate 
gene. The approach used, as well as the libraries and results are shown in Figure 60. In this 
Figure, Panel A lists the libraries tested in this experiment; Panel B shows the regions used; 
Panels C and D show the dot blot hybridization results obtained with these libraries. Positive 
5 libraries were then screened by colony hybridization to obtain genomic and cDNA version of 
tezl gene. In this experiment, approximately 3 x 10 4 colonies from the Hindlll genomic 
library were screened and six positive clones were identified (approximately 0.01%). DNA 
was then prepared from two independent clones (A5 and B2). Figure 61 shows the results 
obtained with the Zfifadlll-digested A5 and B2 positive genomic clones. 
10 In addition, cDNA REP libraries were used. Approximately 3 x 10 5 colonies 

were screened, and 5 positive clones were identified (0.002%). DNA was prepared from 
three independent clones (2-3, 4-1, and 5-20). In later experiments, it was determined that 
clones 2-3 and 5-20 contained identical inserts. 



RT-PCR was conducted to obtain a full length clone. The strategy is schematically shown in 
Figure 62. In this experiment, cDNA was prepared using DNA oligonucleotide primer "M2- 
B" (5'-CAC TGA TCC TTT CTT TTT CGT AAA CGA TAG GT-3') and "M2-B2" (5'-C 

20 ATC AAT CAA ATC TTC CAT ATA GAA ATG ACA-3'), designed from known regions of 
tezl identified previously. An oligonucleotide linker PGR Adapt Sfil with a phosphorylated 
5' end ("P") (P-GGG CCG TGT TGG CCT AGT TCT CTG CTC-3'; was then ligated at the 3 f 
end of this cDNA, and this construct was used as the template for nested PGR. In the first 
round of PGR, PCR Adapt SFI and M2-B were used as the primers; while PCR Adapt Sfill 

25 (5-GAG GAG GAG AAG AGC AGA GAA CTA GGC CAA CAC GCC CC-3'), and M2-B2 
were used as primers in the second round. Nested PCR was used to increase specificity of 
reaction. 
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iv) 5 1 RT PCR 

As the cDNA version of gene produced to this point was not complete, 5 f 
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v) Sequence Alignments 

Once the sequence of tezl was identified, it was compared with sequences 
previously described. Figure 63 shows the alignment of RT domains from telomerase 
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catalytic subunits of S. pombe ("S.p. Tezlp"), S. cerevisiae ("S.c. Est2p"), and E. aediculatus 
pl23 ("E.a. pl23"). In this Figure, "h" indicates hydrophobic residues, while "p" indicates 
small polar residues, and "c" indicates charged residues. The amino acid residues indicated 
above the alignment show a known consensus RT motif of Y. Xiong and T.H. Eickbush (Y. 
5 Xiong and T.H. Eickbush, EMBO J., 9: 3353-3362 [1990]). The asterisks indicate the 

residues that are conserved for all three proteins. "Motif O" is identified herein and in Figure 
63 as a motif specific to this telomerase subunit and not found in reverse transcriptases in 
general. It is therefore valuable in identifying other amino acid sequences as telomerase 
catalytic subunits. 

10 Figure 64 shows the alignment of entire sequences from Euplotes 

("Ea_pl23"), S. cerevisiae ("Sc_Est2p"), and S. pombe C'SpJTezlp"). In Panel A, the shaded 
areas indicate residues shared between two sequences. In Panel B, the shaded areas indicate 
residues shared between all three sequences. 

15 vi) Genetic Disruption of tezl 

In this Example, the effects of disruption of tezl were investigated. As 
telomerase is involved in telomere maintenance, it was hypothesized that if tezl were indeed 
a telomerase component, disruption of tezl would cause gradual telomere shortening. 

In these experiments, homologous recombination was used to disrupt the tezl 
20 gene in S. pombe specifically. This approach is schematically illustrated in Figure 65. As 
indicated in Figure 65, wild type tezl was replaced with a fragment containing the ura4 or 
LEU2 marker. 

The disruption of tezl gene was confirmed by PCR (Figure 66), and a 
Southern blot was performed to check for telomere length. Figure 67 shows the Southern 

25 blot results for this experiment. Because an Apal restriction enzyme site is present 

immediately adjacent to telomeric sequence in S. pombe, Apal digestion of S. pombe genomic 
DNA preparations permits analysis of telomere length. Thus, DNA from S. pombe was 
digested with Apal and the digestion products were run on an agarose gel and probed with a 
telomeric sequence-specific probe to determine whether the telomeres of disrupted S. pombe 

30 cells were shortened. The results are shown in Figure 67. From these results, it was clear 
that disruption of the tezl gene caused a shortening of the telomeres. 
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Q. Cloning and Characterization of Human Telomerase Protein and cDNA 

In this Example, the nucleic and amino acid sequence information for human 
telomerase was determined. Partial homologous sequences were first identified in a BLAST 
search conducted using the Euplotes 123 kDa peptide and nucleic acid sequences, as well as 

5 Schizosaccharomyces protein and corresponding cDNA (tezl) sequences. The human 

sequences (also referred to as "hTCPl.l") were identified from a partial cDNA clone (clone 
712562). Sequences from this clone were aligned with the sequences determined as described 
in previous Examples. 

Figure 1 shows the sequence alignment of the Euplotes ("pi 23"), 

10 Schizosaccharomyces ("tezl"), Est2p (/.<?., the S. cerevisiae protein encoded by the Est! 

nucleic acid sequence, and also referred to herein as "L8543.12"), and the human homolog 
identified in this comparison search. Figure 51 shows the amino acid sequence of tezl, while 
Figure 52 shows the DNA sequence of tezl. In Figure 52, the introns and other non-coding 
regions, are shown in lower case, while the exons (i.e., coding regions) are shown in upper 

15 case. 

As shown in the Figures, there are regions that are highly conserved among 
these proteins. For example, as shown in Figure 1, there are regions of identity in "Motif 0," 
"Motif 1, "Motif 2," and "Motif 3." The identical amino acids are indicated with an asterisk 
(*), while the similar amino acid residues are indicated by a circle (•). This indicates that 

20 there are regions within the telomerase motifs that are conserved among a wide variety of 
eukaryotes, ranging from yeast to ciliates to humans. It is contemplated that additional 
organisms will likewise contain such conserved regions of sequence. Figure 49 shows the 
partial amino acid sequence of the human telomerase motifs, while Figure 50 shows the 
corresponding DNA sequence. 

25 Sanger dideoxy sequencing and other methods were used, as known in the art 

to obtain complete sequence information of clone 712562. Some of the primers used in the 
sequencing are shown in Table 7. These primers were designed to hybridize to the clone), 
based on sequence complementarity to either plasmid backbone sequence or the sequence of 
the human cDNA insert in the clone. 
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Table 7. Primers 



5 



10 



Primer 


Sequence 


TCP1.1 


GTGAAGGCACTGTTCAGCG 


TCP1.2 


GTGGATGATTTCTTGTTGG 


TCP1.3 


ATGCTCCTGCGTTTGGTGG 


TCP1.4 


CTGGACACTCAGCCCTTGG 


TCP1.5 


GGCAGGTGTGCTGGACACT 


TCP1.6 


TTTGATGATGCTGGCGATG 


TCP1.7 


GGGGCTCGTCTTCTACAGG 


TCP1.8 


CAGCAGGAGGATCTTGTAG 


TCP 1.9 


TGACCCCAGGAGTGGCACG 


TCP1.10 


TCAAGCTGACTCGACACCG 


TCP1.11 


CGGCGTGACAGGGCTGC 


TCP1.12 


GCTGAAGGCTGAGTGTCC 


TCP1.13 


TAGTCCATGTTCACAATCG 



From these experiments, it was determined that the EcoRI-NotI insert of clone 
712562 contains only a partial open reading frame for the human telomerase protein, although 

20 it may encode an active fragment of that protein. The open reading frame in the clone 

encodes an approximately 63 kD protein. The sequence of the longest open reading frame 
identified is shown in Figure 68. The ORF begins at the ATG codon with the "met" indicated 
in the Figure. The poly A tail at the 3 f end of the sequence is also shown. Figure 69 shows a 
tentative, preliminary alignment of telomerase reverse transcriptase proteins from the human 

25 sequence (human Telomerase Core Protein 1, "Hs TCP1"), E. aediculatus pl23 ( H Ep p!23), 
S. pombe tezl ("Sp Tezl"), S. cerevisiae EST2 (Sc Est2"), and consensus sequence. In this 
Figure various motifs are indicated. 

To obtain a full-length clone, probing of a cDNA library and 5 '-RACE were 
used to obtain clones encoding portions of the previously uncloned regions. In these 

30 experiments, RACE (Rapid Amplification of cDNA Ends; See e.g., MA. Frohman, "RACE: 
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Rapid Amplification of cDNA Ends," in Innis et al (eds), PCR Protocols: A Guide to 
Methods and Applications [1990], pp. 28-38; and Frohman et al, Proc. Natl. Acad. ScL, 
85:8998-9002 [1988]) was used to generate material for sequence analysis. Four such clones 
were generated and used to provide additional 5 ! sequence information (pFWRPS, 6, 19, and 
5 20). 

In addition, human cDNA libraries (inserted into lambda) were probed with 
the EcoRI-NotI fragment of the clone. One lambda clone, designated "lambda 25-1.1" 
(ATCC accession #209024), was identified as containing complementary sequences. Figure 
75 shows a restriction map of this lambda clone. The human cDNA insert from this clone 

10 was subcloned as an EcoRl restriction fragment into the EcoRI site of commercially available 
phagemid pBluescriptIISK+ (Stratagene), to create the plasmid "pGRN121," which was 
deposited with the ATCC (ATCC accession #209016). Preliminary results indicated that 
plasmid pGRN121 contains the entire open reading frame (ORF) sequence encoding the 
human telomerase protein. 

15 The cDNA insert of plasmid pGRN121 was sequenced using techniques 

known in the art. Figure 70 provides a restriction site and function map of plasmid pGRN121 
identified based on this preliminary work. The results of this preliminary sequence analysis 
are shown in Figure 71. From this analysis, and as shown in Figure 70, a putative start site 
for the coding region was identified at approximately 50 nucleotides from the EcoRI site 

20 (located at position 707), and the location of the telomerase-specific motifs, "FFYVTE" , 
"PKP," "A YD," "QG", and "DD," were identified, in addition to a putative stop site at 
nucleotide #3571 (See, Figure 72, which shows the DNA and corresponding amino acid 
sequences for the open reading frames in the sequence ("a", "b", and "c"). However, due to 
the preliminary nature of the early sequencing work, the reading frames for the various motifs 

25 were found not to be in alignment. 

Additional analysis conducted on the pGRN121 indicated that the plasmid 
contained significant portions from the 5'-end of the coding sequence not present on clone 
712562. Furthermore, pGRN121 was found to contain a variant coding sequence that 
includes an insert of approximately 182 nucleotides. This insert was found to be absent from 

30 the clone. As with the E. aediculatus sequences, such variants can be tested in functional 

assays, such as telomerase assays to detect the presence of functional telomerase in a sample. 
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Further sequence analysis resolved the cDNA sequence of pGRN121 to 
provide a contiguous open reading frame that encodes a protein of molecular weight of 
approximately 127,000 daltons, and 1 132 amino acids as shown in Figure 74. A refined map 
of pGRN121 based on this analysis, is provided in Figure 73. The results of additional 
sequence analysis of the hTRT cDNA are presented in Figure 16 (SEQUENCE ID NO: 1). 

EXAMPLE 2 

CORRELATION OF hTRT ABUNDANCE AN D CELL IMMORTALITY 

The relative abundance of hTRT mRNA was assessed in six telomerase- 
negative mortal cell strains and six telomerase-positive immortal cell lines (Figure 5). The 
steady state level of hTRT mRNA was significantly increased in immortal cell lines that had 
previously been shown to have active telomerase. Lower levels of the hTRT mRNA were 
detected in some telomerase-negative cell strains. 

RT-PCR for hTRT, hTR, TP1 (telomerase-associated protein related to 
Tetrahymena p80 [Harrington et al., 1997, Science 275:973; Nakayama et al., 1997, Cell 
88:875]) and GAPDH (to normalize for equal amounts of RNA template) was carried out on 
RNA derived from the following cells: (1) human fetal lung fibroblasts GFL, (2) human fetal 
skin fibroblasts GFS, (3) adult prostate stromal fibroblasts 31 YO, (4) human fetal knee 
synovial fibroblasts HSF, (5) neonatal foreskin fibroblasts BJ, (6) human fetal lung 
fibroblasts IMR90, and immortalized cell lines: (7) melanoma LOX IMVI, (8) leukemia 
U251, (9) NCI H23 lung carcinoma, (10) colon adenocarcinoma SW620, (1 1) breast tumor 
MCF7, (12) 293 adenovirus El transformed human embryonic kidney cell line. 

hTRT nucleic acid was amplified from cDNA using oligonucleotide primers 
LT5 and LT6 (Table 2) for a total of 31 cycles (94°C 45s, 60°C 45s, 72 °C 90s). GAPDH 
was amplified using primers KI36 (5'-CTCAGACACCATGGGGAA 

GGTGA) and K137 (5'-ATGATCTTGAGGCTGTTGTCATA) for a total of 16 cycles (94°C 
45 s, 55°C 45 s, 72°C 90 s). hTR was amplified using primers F3b (5'-TCTAA 
CCCTAACTGAGAAGGGCGTAG) and R3c (5'-GTTTGCTCTAGAATGAACGGTG 
GAAG) for a total of 22 cycles (94 °C 45s, 55 °C 45 s, 72 °C 90s). TP1 mRNA was 
amplified using primers TP 1.1 and TP1.2 for 28 cycles (cycles the same as hTRT). Reaction 
products were resolved on an 8% polyacrylamide gel, stained with SYBR Green (Molecular 



224 



Probes) and visualized by scanning on a Storm 860 (Molecular Dynamics). The results, 
shown in Figure 5, demonstrate that hTRT mRNA levels correlate directly with telomerase 
activity levels in the cells tested. 

5 EXAMPLE 3 

CHARACTERIZATION OF AN hTRT TNTRONIC SEQUENCE 

A putative intron was first identified by PCR amplification of human genomic 
DNA, as described in this example, and subsequently confirmed by sequencing the genomic 
clone A.G(J)5 (see Example 4). PCR amplification was carried out using the forward primer 

10 TCP1.57 paired individually with the reverse primers TCP1.46, TCP1.48, TCP1.50, 

TCP1 .52, TCP1 .54, TCP1 .56, and TCP1 .58 (see Table 2). The products from genomic DNA 
of the TCP1.57/TCP1.46, TCP1.48, TCP1.50, TCP1.52, TCP1.54, or TCP1.56 amplifications 
were approximately 100 basepairs larger than the products of the pGRN121 amplifications. 
The TCP1.57/TCP1.58 amplification was the same on either genomic or pGRN121 DNA. 

1 5 This indicated the genomic DNA contained an insertion between the sites for TCP1 .58 and 
TCP1.50. The PCR products of TCP 1.57frCP 1.50 and TCP1.57/TCP1.52 were sequenced 
directly, without subcloning, using the primers TCP1.39, TCP1.57, and TCP1.49. 

As shown below, the 104-base intronic sequence (SEQUENCE ID NO: 7) is 
inserted in the hTRT mRNA (shown in bold) at the junction corresponding to bases 274 and 

20 275 of Figure 16: 

CCCCCCGCCGCCCCCTCCTTCCGCCAG/GTGGGCCTCCCCGGGGTCGGCGTCCG 

GCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAG 

GCGACTCAGGGCGCTTCCCCCGCAG/GTGTCCTGCCTGAAGGAGCTGGTGGCC 

CGAGTGCTGCAG 

25 

The 'V indicates the splice junctions; the sequence shows good matches to consensus 5' and 
3' splice site sequences typical for human introns. 

This intron contains motifs characteristic of a topoisomerase II cleavage site 
and a NFkB binding site (see Figure 21). These motifs are of interest, in part, because 
30 expression of topoisomerase II is up regulated in most tumors. It functions to relax DNA by 
cutting and rewinding the DNA, thus increasing expression of particular genes. Inhibitors of 
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topoisomerase II have been shown to work as anti-tumor agents. In the case of NFkB, this 
transcription factor may play a role in regulation of telomerase during terminal 
differentiation, such as in early repression of telomerase during development and so is 
another target for therapeutic intervention to regulate telomerase activity in cells. 

5 

EXAMPLE 4 

CLONING OF LAMBDA PHAGE G$5 AND CHARACTERIZATION OF hTRT 

GENOMIC SEQUENCES 

10 A. Lambda G$5 

A human genomic DNA library was screened by PCR and hybridization to 
identify a genomic clone containing hTRT RNA coding sequences. The library was a human 
fibroblast genomic library made using DNA from WI38 lung fibroblast cells (Stratagene, Cat 
# 946204). In this library, partial Sau3AI fragments are ligated into the Xhol site of Lambda 

1 5 FIX®II Vector (Stratagene), with an insert size of 9-22 kb. 

The genomic library was divided into pools of 150,000 phage each, and each 
pool screened by nested PCR (outer primer pair TCP 1.52 & TCP1.57; inner pair TCP1.49 & 
TCP1.50, see Table 1). These primer pairs span a putative intron (see Example 3, supra) in 
the genomic DNA of hTRT and ensured the PCR product was derived from a genomic source 

20 and not from contamination by the hTRT cDNA clone. Positive pools were further 

subdivided until a pool of 2000 phage was obtained. This pool was plated at low density and 
screened via hybridization with a DNA fragment encompassing basepairs 1552-2108 of 
Figure 16 (restriction sites SphI and EcoRV, respectively). 

Two positive clones were isolated and rescreened via nested PCR as described 

25 above; both clones were positive by PCR. One of the clones (A,G3>5) was digested with NotI, 
revealing an insert size of approximately 20 kb. Subsequent mapping (see below) indicated 
the insert size was 15 kb and that phage G<&5 contains approximately 13 kb of DNA upstream 
from the start site of the cDNA sequence. 

Phage G<&5 was mapped by restriction enzyme digestion and DNA 

30 sequencing. The resulting map is shown in Figure 7. The phage DNA was digested with 
Ncol and the fragments cloned into pBBS167. The resulting subclones were screened by 
PCR to identify those containing sequence corresponding to the 5' region of the hTRT cDNA. 
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A subclone (pGRN140) containing a 9 kb Ncol fragment (with hTRT gene sequence and 4-5 
kb of lambda vector sequence) was partially sequenced to determine the orientation of the 
insert. pGRN 140 was digested using Sail to remove lambda vector sequences, resulting in 
pGRN144. pGRN144 was then sequenced. The results of the sequencing are provided in 
Figure 21. The 5' end of the hTRT mRNA corresponds to base 2441 of Figure 21. As 
indicated in Figure 7, two Alu sequence elements are located 1700 base pairs upstream of the 
hTRT cDNA 5' end and provide a likely upstream limit to the promoter region of hTRT. The 
sequence also reveals an intron positioned at bases 4173 in Figure 21, 3' to the intron 
described in Example 3, supra. 

B. Additional Genomic Clones 

In addition to the genomic clone described above, two PI bacteriophage 
clones and one human BAC clone are provided as illustrative embodiments of the invention. 
PI inserts are usually 75-100 kb, and BAC inserts are usually over 100 Kb. 

The PI clones (DMPC-HFF#1-477(F6) -GS #15371 and 
DMPC-HEF#1-1 103(H6) -GS #15372) were obtained by PCR screening of a human PI 
library derived from human foreskin fibroblast cells (Shepherd et al., 1994, PNAS USA 
91:2629) using primers TCP 1.1 2 and UTR2 which amplify the 3' end of hTRT. These clones 
were both negative (failed to amplify) with primers that amplify the 5* end of hTRT. 

The human BAC clone (326 E 20) was obtained with a hybridization screen of 
a BAC human genomic library using an 1 143 bp Sphl/Xmnl fragment of pGRN121 (Figure 
16; bases 1552-2695) that encompasses the RT motif region. The clone is believed to include 
the 5* end of the gene. The hTRT genomic clones in this example are believed to encompass 
the entire hTRT gene. 

EXAMPLES 
fHROMOSOMAI. LOCATION OF hTRT GENE 
The hTRT gene was localized to chromosome 5p by radiation hybrid mapping 
(Boehnke et al., 1991, Am J Hum Genet 49:1 174; Walter et al., 1994, Nature Genet 7:22) 
using the medium resolution Stanford G3 panel of 83 RH clones of the whole human genome 
(created at the Stanford Human Genome Center). A human lymphoblastoid cell line (donor; 
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rM) was exposed to 10,000 rad of x-rays and was then fused with nonirradiated hamster 
recipient cells (A3). Eighty-three independent somatic cell hybrid clones were isolated, and 
each represents a fusion event between an irradiated donor cell and a recipient hamster cell 
The panel of G3 DNA was used for ordering markers in the region of interest as well as 
establishing the distance between these markers. 

The primers used for the RH mapping were TCP 1 . 1 2 and UTR2 with 
amplification conditions of 94°C 45 sec, 55°C 45 sec, 72°C 45 sec, for 45 cycles using 
Boehringer Mannheim Taq buffer and Perkin-Elmer Taq. The 83 pools were amplified 
independently and 14 (17%) scored positive for hTRT (by appearance of a 346 bp band). The 
amplification results were submitted to Stanford RH server, which then provided the map 
location, 5p, and the closest marker, STS D5S678. 

By querying the Genethon genome mapping web site, the map location 
identified a YAC that contains the STS marker D5S678: CEPH YAC 780_C_3 Size: 
390,660 kb. This YAC also contained chromosome 17 markers. This result indicated that 
the hTRT gene is on chromosome 5, near the telomeric end. There are increased copy 
numbers of 5p in a number of tumors. Cri-du-chat syndrome also has been mapped to 
deletions in this region. 

EXAMPLE 6 

1WSTON AND CONSTRUCTION OF VECTORS F OR EXPRESSION OF hTRT 
PROTETNS AND P OLYNUCLEOTIDES 
Expression of hTRT in Bacteria 

The following portion of this example details the design of hTRT-expressing 
bacterial and eukaryotic cell expression vectors to produce large quantities of full-length, 
biologically active hTRT. Generation of biologically active hTRT protein in this manner is 
useful for telomerase reconstitution assays, assaying for telomerase activity modulators, 
analysis of the activity of newly isolated species of hTRT, identifying and isolating 
compounds which specifically associate with hTRT, analysis of the activity of an hTRT 
variant protein that has been site-specifically mutated, and as an immunogen, as a few 
examples. 
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pThioHis A/hTRT Bacterial Expression Vector 

To produce large quantities of full-length hTRT, the bacterial expression 
vector pThioHis A (Invitrogen, San Diego, CA) was selected as an expression system. The 
hTRT-coding insert includes nucleotides 707 to 4776 of the hTRT insert in the plasmid 
pGRN121. This nucleotide sequence includes the complete coding sequence for the hTRT 
protein. 

This expression vector of the invention is designed for inducible expression in 
bacteria. The vector can be induced to express, in E. coli, high levels of a fusion protein 
composed of a cleavable, HIS tagged thioredoxin moiety and the full length hTRT protein. 
The use of the expression system was in substantial accordance with the manufacturer's 
instructions. The amino acid sequence of the fusion protein encoded by the resulting vector 
of the invention is shown below; (-*-) denotes an enterokinase cleavage site: 
MSDKI IHLTDDS FDTD VLKADGAI LVDFWAHWCGPCKM I API LDE I ADE YQGKLTVAKLRI D 
HNPGTAPKYGIRGIPTLLLFKNGEVAATKVGALSKGQLKEFLDANLAGSGSGDDDDK- * -VP 
MHELE I FEFAAASTQRCVLLRTWEALAPATPAMPRAPRCRAVRSLLRSHYREVLPLATFVRR 
LGPQGWRLVQRGDPAAFRALVAQCLVCVPWDARPPPAAPSFRQVSCLKELVARVLQRLCERG 
AKNVLAFGFALLDGARGGPPEAFTTSVRSYLPNTVTDALRGSGAWGLLLRRVGDDVLVHLLA 
RCALFVLVAPSCAYQVCGPPLYQLGAATQARPPPHASGPRRRLGCERAWNHSVREAGVPLGL 
PAPGARRRGGSASRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCWSPARP 
AEEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYSSGDKEQ 
LRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLELLGNHAQ 
CPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQLLRQHSSPWQVYGF 
VRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLRRSPGV 
GCVPAAEHRLREE ILAKFLHWLMS VYWELLRSFFYVTETTFQKNRLFFYRKSVWSKLQS IG 
IRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYWGARTFRREKR 
AERLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVRAQDPPPELYFVKVD 
VTGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAWQKAAHGHVRKAFKSHVSTLTDLQPYMR 

QFVAHLQETSPLRDAWIEQS SSLNEASSGLFDVFLRFMCHHAVRI RGKS YVQCQG I PQGS I 
LSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCW 
NLRKTWNFPVEDEALGGTAFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTF 
NRGFKAGRNMRRKLFGVLRLKCTSLFLDLQVNSLQTVCTNIYKILLLQAYRFHACVLQLPFH 
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QQVWKNPTFFLRVI SDTASLCYS ILKAKNAGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTR 
HRVTYVPLLGSLRTAQTQLSRKLPGTTLTALEAAANPALPSDFKTILD 

pGEX-2TK with hTRT Nucleotides 3272 to 4177 of pGRN121 

5 This construct of the invention is used to produce fusion protein for, e.g., the 

purpose of raising polyclonal and monoclonal antibodies to hTRT protein. Fragments of 
hTRT can also be used for other purposes, such as to modulate telomerase activity, for 
example, as a dominant-negative mutant or to prevent the association of a telomerase 
component with other proteins or nucleic acids. 

I o To produce large quantities of an hTRT protein fragment, the E. coli 

expression vector pGEX-2TK (Pharmacia Biotech, Piscataway N.J) was selected, and used 
essentially according to manufacturer's instructions to make an expression vector of the 
invention. The resulting construct contains an insert derived from nucleotides 3272 to 4177 
of the hTRT insert in the plasmid pGRN12L The vector directs expression in E. coli of high 

1 5 levels of a fusion protein composed of glutathione-S-transferase sequence (underlined below), 
thrombin cleavage sequence (double underlined), recognition sequence for heart muscle 
protein kinase (italicized), residues introduced by cloning in brackets ([GSVTK]) and hTRT 
protein fragment (in bold) as shown below: 

20 VTCT .TOSMAI TT? YT ADKHNMT iCiGCPKERA E T SMLEGAVT iD IRYGVSR I AYSKPFETLKVPFLg 
K"T .PTCMT .KMFEDRLCHKTYLNGDHVTHPDFMT .YDALD WLYMDPMCLDAFPKLVCFKKRX EM 
POTDTCYI.KSSTCYTAWPLQnWOATFGGGD^PPTCSDLVPRGS i^AgVCGSVTKl IPQGSILSTL 
LCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLRK 
TVVNFPVEDEALGGTAFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASVTFNRGF 

25 KAGRl^RKLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVW 
KNPTFFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVT 
YVPLLGS LRTAQTQL S RKLPGTTIiTAIiEAAANP ALP SDFKTILD 

When this fusion protein was expressed, it formed insoluble aggregates. It 
30 was treated generally as described above, in the section entitled purification of proteins from 
inclusion bodies. Specifically, induced cells were suspended in PBS (20 mM sodium 
phosphate, pH 7.4, 150 mM NaCl) and disrupted by sonication. NP-40 was added to 0.1%, 
and the mixture was incubated for 30 minutes at 4°C with gentle mixing. The insoluble 
material was collected by centrifugation at 25,000g for 30 minutes at 4°C. The insoluble 
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material was washed once in 4M urea in PBS, collected by centrifugation, then washed again 
in PBS. The collected pellet was estimated to contain greater than 75% fusion protein. This 
material was dried in a speed vacuum, then suspended in adjuvant for injection into mice and 
rabbits for the generation of antibodies. Separation of the recombinant protein from the 
5 glutathione S-transferase moiety is accomplished by site-specific proteolysis using thrombin 
according to manufacturer's instructions. 

p GEX-2TK with hTRT Nucleotides 2426 to 3274 of pGRN121 with HISS Tag 

To produce large quantities of a fragment of hTRT, another K coli expression 

10 vector pGEX-2TK construct was prepared. This construct contains an insert derived from 
nucleotides 2426 to 3274 of the hTRT insert in the plasmid pGRN121 and a sequence 
encoding eight consecutive histidine residues (HIS-8 Tag). To insert the HIS-8 TAG, the 
pGEX-2TK vector with hTRT nucleotides 2426 to 3274 of pGRN121 was linearized with 
BamH 1 . This opened the plasmid at the junction between the GST-thrombin-heart muscle 

1 5 protein kinase and the hTRT coding sequence. A double stranded oligonucleotide with 
BamHl compatible ends was ligated to the linearized plasmid resulting in the in-frame 
introduction of eight histidine residues upstream of the hTRT sequence. 

The vector directs expression in E coli of high levels of a fusion protein 
composed of glutathione-S-transferase sequence (underlined); thrombin cleavage sequence 

20 (double underlined); recognition sequence for heart muscle protein kinase (italicized); a set of 
three and a set of five residues introduced by cloning are in brackets ([GSV] and [GSVTK]); 
eight consecutive histidines (also double underlined); and hTRT protein fragment (in bold): 
MSPTTr T VWTKGT.VnPTRTJXEYLF.F.KYFPmYF -RnFrTDKWNKKFFI,GLEFPNLPY 
VTnfTTWKT TOSMATTT? YT ADKHNMT .OGCPKERA FISMLEGAVT ,DTRYGVSRT AYSKDF 

25 F.TT .K VDFLSKT .PF.MT JCMFFDR T .CHKTYT .NGDHVTHPDFMT ,YDALDVVT,YMDPM CL 
D A FPKT VGFKKRTEATPOTDK YLKSSK YT A WPT ,OG WO ATFGGGDHPPK SDT .VPRGS-Ri? 
^SnGSV]^^ffiF^[GSVTK]MSVYVVELLRSFFYVTETTFQK]>^FFYRPSVWS 
KLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMD 
YWGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDDIHRAW 

30 RTFVLRVRAQDPPPELYFVKVDVTGAYDTIPQDRLTEVIASIIKPQNTYCVRRYA 
WQKAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAWIEQSSSL 

NEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGI 

Each of the pGEX-2TK vectors of the invention can be used to produce fusion 
35 protein for the purpose of raising polyclonal and monoclonal antibodies to hTRT protein. 
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Additionally, this fusion protein can be used to affinity purify antibodies raised to hTRT 
peptides that are encompassed within the fusion protein. Separation of the recombinant 
protein from the glutathione S-transferase moiety can be accomplished by site-specific 
proteolysis using thrombin according to manufacturer's instructions. 

5 

pGEX-2TK with hTRT Nucleotides 2426 to 3274 ofpGRN121, mHIS-8 Tag 

To produce large quantities of a fragment of hTRT, another E. coli expression 
vector pGEX-2TK construct was prepared. 

This construct contains an insert derived from nucleotides 2426 to 3274 of the 
1 0 hTRT insert in the plasmid pGRNl 2 1 , but without the HIS-8 tag of the construct described 
above. The vector directs expression in E coli of high levels of a fusion protein composed of 
glutathione-S-transferase (underlined), thrombin cleavage sequence (double underlined), 
recognition sequence for heart muscle protein kinase (italicized), residues introduced by 
cloning in brackets ([GSVTK]) and hTRT protein fragment (in bold): 

15 M.qPTT,nVWTCTTCm,VnPTRTJJ,KVT,EEKV^KmYERD KGnKWRNKKFELGLEFP NLPYYXPgD 
VTfT.TngMATTPYTAPyT^T.r,flr!PK TC PaF.T.qMT,F.aAVT,DTRYGVSFTAY.qKDFF.TLKVDFLS 
yT.PBMT.TCMPKnRLCtTTCTYT .NflnTTV T HPnFMLYDAT .DVyLYMPPMCT .DAFPKLVCFKKRIEAI 
pnTnTryLKSSyYTAWPTOCTn&TPC^DP PPTCfiDLVPRGSRRAgyCGSVTK] MSVYWELLR 
SFFYVTETTFQKNRLFPYRPSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSR 

20 LRFIPKPDGLRPIVNMDYWGARTFRREKRAERLTSRKALFSVLNYERARRPGLLGASVLGL 
DDIHRAVTOTFVLRVRAQDPPPEYFVKVDVTGAYDTIPQDRLTEVIASIIKPQNTYCVRRYAV 
VQKAAHGVRKAFKSHVSTLTDLQP YMRQFVAHLQETSPLRDAVVIEQS S S LNEAS GLFDVFL 
RFMCHHAVRIRGKSYVQCQGI 

25 pGEX-2TK with hTRT Nucleotides 1625 to 2458 of pGRN121 

To produce large quantities of a fragment of hTRT protein, another E. coli 
expression vector pGEX-2TK construct was prepared. 

This construct contains an insert derived from nucleotides 1 625 to 2458 of the 
hTRT insert in the plasmid pGRN121. The vector directs expression in E coli of high levels 
30 of a fusion protein composed of glutathione-S-transferase, (underlined), thrombin cleavage 
sequence (double underlined), recognition sequence for heart muscle protein kinase 
(italicized) residues introduced by cloning in brackets ([GSVTK]) and hTRT protein 
fragment (in bold): 

MSPTT .G YWKTK GL VOPTRTXLE YLFF.K YF.FHT ,YE R T")F.GDK WRNKKFFJ ,GLEFPNLP Y 
35 VTDGnVKLTOSM ATTR YTADKHNM T XtGCP K 'F.K AF.TSMT ,FG A VT ,DTRYGVSRI A YSKDF 



232 



F.TT .KVDFLSKT PFMT,KMFFr)R T .flHKTYI .NGnHVTHPD FMT ,YD ALD WLYMDPMCL 

nAFPKT.VCFKKRTFATPOTDKYT-TCSSKYTAW PT.OGWOATFGGrTDHPPKSDLVPRGS^ 

^5F[GSVTK]ATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAE 

TKHFLYSSGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRL 

PQRYWQMRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGS 

VAAPEEEDTDPRRLVQLLRQHSSPWQVYGFVRACLRRLVPPGLWGSRHNERRF 

LRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLRRSPGVGCVPAAEHRLREEI 

LAKFLHWLMSVYWELLRS 



pGEX-2TK with h TRT Nucleotides 782 to 1636 of pGRN121 

To produce large quantities of a fragment of hTRT protein, another E. coli 
expression vector pGEX-2TK construct was prepared. 

This construct contains an insert derived from nucleotides 782 to 1636 of the 
hTRT insert in the plasmid pGRN121 . The vector directs expression in E coli of high levels 
of a fusion protein composed of glutathione-S-transferase, (underlined), thrombin cleavage 
sequence (double underlined), recognition sequence for heart muscle protein kinase 
(italicized) residues introduced by cloning in brackets ([GSVTK]) and hTRT protein 
fragment (in bold): 

M.qPTT ,C,VWK"nreT .VOPTPT ,T.T ■F.YT/EEKVF.Fm.YER D EGnKWBNKTCFF.T i(TT iEFPNTiPYYIPGD 
WT/rnfiMATTPVTAnKHNMT/y:rPXF.^^ 

TCT.PF.MT.KMFBn^T.rHKTYT.Mnnm/THPn FMT.YDAL DW T . YTIDPMC^P AFPKT.VCFKKRIEAI 
pnTDTCVT J KSSTCVTAWPT,OGW nATFr,P?r,DHPPTCSDLVPRGSRRAgy[GSvTK] MPRAPRCRAV 
RSLLSHYREVLPLATFVRRLGPQGWRLVQRGDPAAFRALVAQCLVCVPWDARPPAAPSFRQV 
SCLKELVARVXQRLCERGAKNvTiAFGFALLDGARGGPPEATTSVRSYLPNTVTDALRGSGAW 
GLLLRRVGDDVLVHLLARCALFVLVAPCAYQVCGPPLYQLGAATQARPPPHASGPRRRLGCE 
RAWNHSVREAGVPLGLPAPGARRRGGSASRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRT 

RGPSDRGFCWSPARPAEEATSL 



pT7FLh TRT with h TRT cDNA Lacking S'-Non-Coding Sequence 

As described above, in one embodiment, the invention provides for an hTRT 
that is modified in a site-specific manner to facilitate cloning into bacterial, mammalian, yeast 
and insect expression vectors without any 5' untranslated hTRT sequence. In some 
circumstances, minimizing the amount of non-protein encoding sequence allows for 
improved protein production (yield) and increased mRNA stability. In this embodiment of 
the invention, the hTRT gene's 5' non-coding region was removed before cloning into a 
bacterial expression vector. 

This was effected by engineering an additional restriction endonuclease site 
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just upstream (5') to the start (ATG) codon of the hTRT coding sequence (Figure 16). The 
creation of a restriction site just 5' to the coding region of the protein allows for efficient 
production of a wide variety of vectors that encode fusion proteins, such as fusion proteins 
comprising labels and peptide TAGs, for immunodetection and purification. 

5 Specifically, the oligonucleotide 

5'- CCGGCCACCCCCCATATGCCGCGCGCTCCC-3' was used as described above to 
modify hTRT cDNA nucleotides 779 to 781 of the hTRT cDNA (Figure 16) from GCG to 
CAT. These 3 nucleotides are the last nucleotides before the ATG start codon so they do not 
modify the protein sequence. The change in sequence results in the creation of a unique Ndel 

1 0 restriction site in the hTRT cDNA. Single-stranded hTRT DN A was used as a DNA source 
for the site directed mutagenesis. The resulting plasmid was sequenced to confirm the 
success of the mutagenesis. 

This modification allowed the construction of the following plasmid of the 
invention, designated pT7FLhTRT. The site-specifically modified hTRT sequence (addition 

1 5 of the Ndel restriction site) was digested with Ndel and NotI (and filled in with Klenow 

enzyme to generate blunt ended DNA) to generate an hTRT encoding nucleic acid fragment. 
The fragment was then cloned into a pSL3418 plasmid previously restriction digested with 
Ndel and Smal (also a blunt ended cutter). pSL 3418 is a modified pAED4 plasmid into 
which a FLAG sequence (Immunex Corp, Seattle WA) and an enterokinase sequence are 

20 inserted just upstream from the above-referenced Ndel site. This plasmid, designated 

pT7FLhTR, allows the expression of full length hTRT (with a Flag-Tag at its 5' end) in an 
Kcoli strain expressing the T7 RNA polymerase. 

Plasmids with hTRT cDNA Lacking S'-Non-Coding Sequence 

25 As discussed above, the invention provides for expression vectors containing 

TRT-encoding nucleic acids in which some or all non-coding sequences have been deleted. 
In some circumstances, minimizing the amount of non-protein encoding sequence allows for 
improved protein production (yield) and increases mRNA stability. In this embodiment of 
the invention, the 3' untranslated region of hTRT is deleted before cloning into a bacterial 

30 expression plasmid. 

The plasmid pGRN121, containing the full length hTRT cDNA, as discussed 
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above, was first deleted of all Apal sites. This was followed by deletion of the MscI-HincII 
hTRT restriction digest enzyme fragment containing the 3'UTR. The Ncol-Xbal restriction 
digest fragment containing the stop codon of hTRT was then inserted into the Ncol-Xbal site 
of pGRN121 to make a plasmid equivalent to pGRN121, designated pGRN124, except 
lacking the 3TJTR. 

Bacterial Expression Vectors Using Antibiotic Selection Markers 

The invention also provides for bacterial expression vectors that can contain 
selection markers to confer a selectable phenotype on transformed cells and sequences coding 
for episomal maintenance and replication such that integration into the host genome is not 
required. For example, the marker may encode antibiotic resistance, particularly resistance to 
chloramphenicol (see Harrod (1997) Nucleic Acids Res. 25: 1720-1726), kanamycin, G418, 
bleomycin and hygromycin, to permit selection of those cells transformed with the desired 
DNA sequences, see for example, Blondelet-Rouault (1997) Gene 190:315-317; and Mahan 
(1995) Proc Natl Acad Sci USA 92:669-673. 

In one embodiment of the invention, the full length hTRT was cloned into a 
modified BlueScript plasmid vector (Stratagene, San Diego, CA), designated pBBS235, into 
which a chloramphenicol antibiotic resistence gene had been inserted. The NotI fragment 
from pGRN124 (discussed above) containing the hTRT ORF into the NotI site of pBBS235 
so that the TRT ORF is in the opposite orientation of the vector's Lac promoter. This makes a 
plasmid that is suitable for mutageneis of plasmid inserts, such as TRT nucleic acids of the 
invention. This plasmid construct, designated pGRN125, can be used in the methods of the 
invention involving mutagenesis of telomerase enzyme and TRT protein coding sequences 
and for in vitro transcription of hTRT using the T7 promoter (and in vitro transcription of 
antisense hTRT using the T3 promoter). 

In another embodiment of the invention, NotI restriction digest fragments from 
pGRN124 containing the hTRT ORF were subcloned into the NotI site of pBBS235 
(described above) so the TRT ORF is in the same orientation as the vector's Lac promoter. 
This makes a plasmid, designated pGRN126, that can be used for expression of full length 
hTRT in E. colu The expressed product will contain 29 amino acids encoded by the vector 
pBBS235, followed by 18 amino acids encoded by the 5'UTR of hTRT, followed by the full 
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length hTRT protein. 

In a further embodiment of the invention, in vitro mutagenesis of pGRN125 
was done to convert the hTRT initiating ATG codon into a Kozak consensus and create 
EcoRI and Bglll restriction digest sites to facilitate cloning into expression vectors. The 
5 oligonucleotide 

5-TGCGCACGTGGGAAGCCCTGGCagatctgAattCcaCcATGCCGCGCGCTCCCCGCTG- 
y (altered nucleotides in lower case) was used in the mutagenesis procedure. The resulting 
expression vector was designated pGRN127. 

In another embodiment of the invention, the second Asp of the TRT "DD 
1 0 motif was converted to an alanine to create a non-functional telomerse enzyme, thus creating 
a mutant TRT protein for use as a dominant/negative mutant. The hTRT coding sequence 
was mutagenized in vitro using the oligonucleotide 5 f - 

CGGGACGGGCTGCTCCTGCGTTTGGTGGAcGcgTTCTTGTTGGTGACACCTCACCT 
CACC-^ to convert the asparagine codon for residue 869 (Asp869) to an alanine (Ala) 
15 codon. This also created an Mlul restriction enzyme site. The resulting expression plasmid 
was designated pGRN130, which also contains the Kozak consensus sequence as described 
forpGRN127. 

The invention also provides a vector designed to express an antisense 
sequence fragment of hTRT. The pGRN126 plasmid was cut to completion with MscI and 

20 Smal restriction enzymes and religated to delete over 95% of the hTRT ORF. One 

Smal-MscI fragment was re-inserted during the process to recreate CAT activity. This 
unpurified plasmid was then redigested with Sail and EcoRI and the fragment containing the 
initiating codon of the hTRT ORF was inserted into the Sall-EcoRI sites of pBBS212 to 
make an antisense expression plasmid expressing the antisense sequence spanning the 5'UTR 

25 and 73 bases pair residues of the hTRT ORF (in mammalian cells). This plasmid was 
designated pGRN135. 
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Expression of hTRT Telomerase in Yeast 

The present invention also provides hTRT-expressing yeast expression vectors 
to produce large quantities of full-length, biologically active hTRT. 

5 Piehia nastoris Expression Vector pPICZ B and Full Length hTRT 

To produce large quantities of full-length, biologically active hTRT , the Picha 
pastoris expression vector pPICZ B (Invitrogen, San Diego, CA) was selected. The hTRT- 
coding sequence insert was derived from nucleotides 659 to 4801 of the hTRT insert in 
plasmid pGRN121. This nucleotide sequence includes the full-length sequence encoding 
1 0 hTRT. This expression vector is designed for inducible expression in P. pastoris of high 
levels of full-length, unmodified hTRT protein. Expression is driven by a yeast promoter, 
but the expressed sequence utilizes the hTRT initiation and termination codons. No 
exogenous codons were introduced by the cloning. The resulting pPICZ B/hTRT vector was 
used to transform the yeast. 

15 

Pichia pastoris Expression Vector hTRT-His6/pPICZ B 

A second Picha pastoris expression vector of the invention derived from 
pPICZ B, also contains the full-length sequence encoding hTRT derived from nucleotides 
659 to 4801 of the hTRT insert in the plasmid pGRN121. This hTRT-His6/pPICZ B 

20 expression vector encodes full length hTRT protein fused at its C-terminus to the Myc 
epitope and His6 reporter tag sequences. The hTRT stop codon has been removed and 
replaced by vector sequences encoding the Myc epitope and the His6 reporter tag as well as a 
stop codon. This vector is designed to direct high-level inducible expression in yeast of the 
following fusion protein, which consists of hTRT sequence (underlined), vector sequences in 

25 brackets ([L] and [NSAVD]) the Myc epitope (double underlined), and the His6 tag 
(italicized): 

MPT?APRCRAVRSLT-PSHYRHVLPT-ATFVRRT.G POrTWRT.VORGDPAAFRALVAQCLV 
PVP WD A R PPP A APSFRO VSri -'KF.T ,V A R VT ,ORT .CERGAKNVL AFGFAI J .DGARGQPP 
F. A FTTS VR S YT J'NTVTD ALRGSfi A WGT J ,LRR VGDDVLVHT ,T , ARC ALFVT ,V APSCAY 
30 OVCGPPLYOT .G A ATO ARPPPH A SGPRRRT .CrCFR A WNHS VR F, AGVPLGT ,P APGARRR 
fifiS A £R ST ,PT ,PKRPRR OA APF.PF,RTP VrrOGS WAHPGRTRGPSDRGFCVVSP ARPAEE, 
ATSTF.r T AT.SGTRHSHPSVr T ROHHAGP P STSRPPRPWDTPCPPVYAETKFrFLYSSGPK 
F.OT .R PSFT J ,S SLRPST TG ARRL VF.TTFT .GSRPW MPGTPRRLPR T ,POR YWOMRPLFLEL 
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TGNHAOrPYrTVLI.KTHCPT.RAAVTPAAGVCARRKPOGSVAAPEEEDTDPRRLVOLL 
R OHSSPWOVYGFVR ACLRRLVPPGT .WGSRHNERRFLRNTKK FISmKHAKLSLOELT 
WKMSVRDCAWI.RRSPGVGCVPAAEHRT.REEILAKFLHWLMSVYVVELLRSFFYVT 
F.TTFOTCNR T ,FF YRKS VWSKLQSIGTR OHT ,KR VO T ,RFJ ,SE AEVROHREARP ALLTSRLR 
5 FTPKPDGLRPIVMMnYVVGARTFRREKRAF .RTTSRVKAr.FSVLNYERARRPGLLGAS 
VT GT DDTFm A WRTFVT ,R VR A ODPPPET .YFVK VDVTGA YDTIPO DRLTEVIASIIKPON 
TYT VR R YAWOKA AHGFTVRK AFK SFTVSTLTDT ,OP YMROFV AHLOETSPT ,RD A WIE 
OSSST.NF.ASSGT.FDVFT.RFMCHHAVRTRGKSYVOCOGIPOGSILSTLLCSLCYGDMEN 
TCT.FAGmRDGLLLRJ-VDnFTXVTPHrTHAKTFLRTT.VRGVPEY GCVVNT.RKTVVNFP 
10 VEDEAT.GGTAFVOMPAHGT.FPWCGLLI.nTRTT.EVOSDYSSYARTSIRASLTFNRGFK 
A GRMMR R TCT .FGVLR T .KCHSLFLDJ .OVNST .OTVC TNT YKTT J J ,Q A YRFHACVLOLPFH 
GOV WTCNPTFFLRVISDT A ST ,C YSII.K A KN A GMST ,G AKGAAGPLPSEAVOWLCHO AF 
T T .TgT TRHR VTYVPLT GST .RTAOTOT .SRKT.PGTTT TAT M A A AN P AT.PSDFKTILDrLlEO 
KLTSEEDLn^ SAVD WHHHHH" 

15 

Expression of hTRT in Insect Cells 

The present invention also provides hTRT telomerase-expressing insect cell 
expression vectors that produce large quantities of full-length, biologically active hTRT. 



20 Baculovirus Expression Vector pVL1393 and Full Length hTRT 

The telomerase coding sequence of interest was cloned into the baculovirus 
expression vector pVL1393 (Invitrogen, San Diego, CA). This construct was subsequently 
cotransfected into Spodopterafiingupeida (sf-9) cells with linearized DNA from Autograph 
California nuclear polyhedrosis virus (Baculogold-AcMNPV). The recombinant 
25 baculoviruses obtained were subsequently plaque purified and expanded following standard 
protocols. 

This expression vector provides for expression in insect cells of high levels of 
full-length hTRT protein. Expression is driven by a baculoviral polyhedrin gene promoter. 
No exogenous codons were introduced by the cloning. 
30 Baculovirus Expression Vector pBlueBacHis2 B and Full Length hTRT 

To produce large quantities of full-length, biologically active hTRT, the 
baculovirus expression vector pBlueBacHis2 B (Invitrogen, San Diego, CA) was selected as a 
source of control elements. The hTRT-coding insert consisted of nucleotides 707 to 4776 of 
the hTRT insert in plasmid pGRN121 . 
35 A full length hTRT with a His6 and Anti-Xpress tags (Invitrogen) was also 

constructed. This vector also contains an insert consisting of nucleotides 707 to 4776 of the 
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hTRT insert from the plasmid pGRN121. The vector directs expression in insect cells of high 
levels of full length hTRT protein fused to a cleavable 6-histidine and Anti-Xpress tags, and 
the amino acid sequence of the fusion protein is shown below; (-*-) denotes enterokinase 
cleavage site: 

MPRGSHHHHHHGMASMTGGQQMGRDL YDDDDL-* -DPS SRS AAGTMEF AAA 
STQRCVLLRTWEALAPATPAMPRAPRCRAVRSLLRSHYREVLPLATFVRRLGPQGW 
RLVQRGDPAAFRALVAQCLVCVPWDARPPPAAPSFRQVSCLKELVARVLQRLCERG 
AKNVL AFGF ALLDGARGGPPEAFTTS VRS YLPNTVTD ALRGS GAWGLLLRRVGDD V 
LVHLLARCALFVLVAPSCAYQVCGPPLYQLGAATQARPPPHASGPRRRLGCERAWN 
HSVREAGVPLGLPAPGARRRGGSASRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGR 
TRGPSDRGFCWSPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDT 
PCPPVYAETKHFLYSSGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRR 
LPRLPQRYWQMRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGS 
VAAPEEEDTDPRRLVQLLRQHSSPWQVYGFVRACLRRLVPPGLWGSRHNERRFLRN 
TKKFISLGKHAKLSLQELTWKMSVRDCAWLRRSPGVGCVPAAEHRLREEILAKFLH 
WLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQLRELSE 
AEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKRAERLTSRVKA 
LFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVRAQDPPPELYFVKVDVTGA 
YDTIPQDRLTEVIASIIKPQNTYCVRRYAVVQKAAHGHVRKAFKSHVSTLTDLQPYM 
RQFVAHLQETSPLRDAWIEQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGI 
PQGSILSTXLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVR 
GVPEYGCVVmRKTVVNFPVEDEALGGTAFVQMPAHGLFPWCGLLLDTRTLEVQSD 
YSSYARTS1RASLTFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTNIY 
KILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSLGAK 
GAAGPLPSEAVQWLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLPGTTLTALE 

AAANPALPSDFKTILD 

Baculovirus Expression Vector pBlueBac4.S and Full Length hTRT Protein 

To produce large quantities of full-length, biologically active hTRT, a second 
baculovirus expression vector, pBlueBac4.5 (Invitrogen, San Diego, CA) was constructed. 
The hTRT-coding insert also consisted of nucleotides 707 to 4776 of the hTRT from the 
plasmid pGRN121. 
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Baculoviras Expression Vector pMelBacB and Full Length hTRT Protein 

To produce large quantities of full-length, biologically active hTRT, a third 
baculovirus expression vector, pMelBacB (Invitrogen, San Diego, CA) was constructed. The 
hTRT-coding insert also consists of nucleotides 707 to 4776 of the hTRT insert from the 
plasmidpGRN121. 

pMelBacB directs expression of full length hTRT in insect cells to the 
extracellular medium through the secretory pathway using the melittin signal sequence. High 
levels of full length hTRT are thus secreted. The melittin signal sequence is cleaved upon 
excretion, but is part of the protein pool that remains intracellularly. For that reason, it is 
indicated in parentheses in the following sequence. The sequence of the fusion protein 
encoded by the vector is shown below: 

(MKFLVNVALWMVVYISYIYA)-*-DPSSRSAAGTMEFAAASTQRCVLLRTWE 
ALAPATPAMPRAPRCRAVRSLLRSHYREVLPLATFVRRLGPQGWRLVQRGDPAAFR 
ALVAQCLVCWWDARPPPAAPSFRQVSCLKELVARVLQRLCERGAKNVLAFGFALL 
DGARGGPPEAFTTSVRSYLPNTVTDALRGSGAWGLLLRRVGDDVLVHLLARCALFV 
LVAPSCAYQVCGPPLYQLGAATQARPPPHASGPRRRLGCERAWNHSVREAGVPLGL 
PAPGARRRGGSASRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCW 
SPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHF 
LYSSGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQM 
RPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPR 
PXVQLLRQHSSPWQVYGFVRACLRRLWPGLWGSRHNERP^LRNTKKFISLGKHAK 
LSLQELTWKMSVRDCAWLPJISPGVGCWAAEHRLREEILAKFLHWLMSVYVVELL 
RSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPA 
LLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKRAERLTSRVKALFSVLNYERARRP 
GLLGASVLGLDDIHRAWRTFVLRVRAQDPPPELYFVKVDVTGAYDTIPQDRLTEVIA 
SnKPQNTYCVRRYAVVQKAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPL 
RDAVVIEQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSILSTLLCSLC 
YGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLR 
KTVVNFPVEDEALGGTAFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASL 
TFNRGFKAGRNMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTMYKILLLQAYRFHAC 
VLQLPFHQQVWKNPTFFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVQ 
WLCHQAFLLKLTRHRVTYVPLLGSLRTAQTQLSRKLPGTTLTALEAAANPALPSDFK 

TILD 



Expression of hTRT in Mammalian Cells 

The present invention also provides vectors to produce hTRT in large 
quantities as full-length, biologically active protein in a variety of mammalian cell lines, 
which is useful in many embodiments of the invention, as discussed above. 
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MPSV-hTRT Expression Plasmids 

The invention also provides for an expression system for use in mammalian 
cells that gives the highest possible expression of recombinant protein, such as telomerase, 
without actually modifying the coding sequence (e.g. optimizing codon usage). In one 
embodiment, the invention provides MPSV mammalian expression plasmids (from plasmid 
pBBS212, described as pMPSV-TM in Lin J-H (1994) Gene 47:287-292) capable of 
expressing the TRTs of the invention. The MPSV plasmids can be expressed either as stable 
or transient clones. 

In this expression system, while the hTRT coding sequence itself is 
unchanged, exogenous transcriptional control elements are incorporated into the vector. The 
myeloproliferative sarcoma virus (MPSV) LTR (MPSV-LTR) promoter, enhanced by the 
cytomegalovirus (CMV) enhancer, is incorporated for transcriptional initiation. This 
promoter consistently shows higher expression levels in cell lines (see Lin J-H (1994) supra). 
A Kozak consensus sequence can be incorporated for translation initiation (see Kozak (1996) 
Mamm. Genome 7:563-57 r 4). All extraneous 5 1 and 3 f untranslated hTRT sequences can be 
removed to insure that these sequences do not interfere with expression, as discussed above. 
The MPSV plasmid containing the complete hTRT coding sequence, with all extraneous 
sequences included, is designated pGRN133. A control, hTRT "antisense" plasmid was also 
constructed. This vector is identical to pGRN133 except that the TRT insert is the antisense 
sequence of hTRT (the antisense, which control can be used as a vector is designated 
pGRN134). The MPSV plasmid containing the complete hTRT coding sequence with all 
other extraneous sequences removed and containing the Kozak consensus sequence is 
designated pGRN145. 

Two selection markers, PAC (Puromycin-N-acetyl-transferase = Puromycin 
resistance) and HygB (Hygromycin B = Hygromycin resistance) are present for selection of 
the plasmids after transfection (see discussion referring to selectable markers, above). 
Double selection using markers on both sides of the vector polylinker should increase the 
stability of the hTRT coding sequence. A DHFR (dihydrofolate reductase) encoding 
sequence is included to allow amplification of the expression cassette after stable clones are 
made. Other means of gene amplification can also be used to increase recombinant protein 
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yields. 

The invention also provides for MPSV mammalian expression plasmids 
containing hTRT fusion proteins. In one embodiment, the hTRT sequence, while retaining its 
5' untranslated region, is linked to an epitope flag, such as the IBI FLAG (International 

5 Biotechnologies Inc. (IBI), Kodak, New Haven, CT) and inserted into the MPSV expression 
plasmid (designated pGRN147). This particular constuct contains a Kozak translation 
initiation site. The expressed fusion protein can be purified using the M-l anti-FLAG 
octapeptide monoclonal antibody (IBI, Kodak, supra). 

In another embodiment, hTRT is site-specifically altered. One amino acid 

1 0 residue codon is mutagenized, changing the aspartic acid at position 869 to an alanine. This 
Asp869->Ala hTRT mutant, retaining its 5' untranslated region and incorporating a Kozak 
sequence, was inserted into an MPSV expression plasmid, and designated pGRN146. The 
Asp869->Ala hTRT mutant was further engineered to contain the FLAG sequence, as 
described above, and the insert cloned into an MPSV expression plasmid. One such 

1 5 expression plasmid is designated pGRNl 54-1. Specifically, for pGRNl 54-1, an Eaml 1 051 
restriction digest fragment from pGRN146 containing the Kozak sequence-containing "front 
end" (5* segment) of hTRT is cloned into the Eaml 1051 sites of pGRN147 (see above) to 
make an MPSV expression plasmid capable of expressing hTRT with a Kozak sequence, the 
above-described D869->A mutation, and the IBI flag. 

20 Another embodiment of the invention is an expression plasmid derived from 

pGRN146. The mammalian expression plasmid, designated pGRNl 52, was generated by 
excising the EcoRI fragment from plasmid pGRN146 (containing the hTRT ORF) and cloned 
into the EcoRI site of pBBS212 to remove the 5'UTR of hTRT. The hTRT is oriented so that 
its expression is controlled by the MPSV promoter. This makes a mammalian expression 

25 plasmid that expresses hTRT with a Kozak consensus sequence and the D869->A mutation, 
and uses the MPSV promoter. 

The invention provides for a mammalian expression vector in which hTRT is 
oriented so that the hTRT coding sequence is driven by the MPSV promoter. For example, 
an EcoRI restriction digest fragment from pGRNl 37 containing the hTRT open reading 

30 frame (ORF) was cloned into the EcoRI site of pBBS212 (see below), thus removing the 5' 
untranslated region (5'-UTR) of hTRT. pGRN137 was constructed by excising a 



242 




SalI-Sse8387I fragment from pGRN130, described below, containing the Kozak mutation of 
hTRT into the Sal 1-SSE 83871 sites of pGRN136, making a mammalian expression plasmid 
expressing hTRT containing a Kozak consensus sequence off the MPSV promoter. Plasmid 
pGRN136 was constructed by excising a Hindlll Sail fragment from pGRN126 containing 
5 the hTRT ORF and cloning it into the Hindlll Sail sites of plasmid, pBBS242, making a 
mammalian expression plasmid expressing hTRT off the MPSV promoter). This makes a 
mammalian expression plasmid, designated pGRN145, that expresses hTRT with a Kozak 
consensus sequence using the MPSV promoter. See also the pGRN152 MPSV promoter- 
driven mammalian expression vector described below. 

10 

hTRT Expressed in 293 Cells using Episomal Vector pEBVHis 

An episomal vector, pEBVHis (Invitrogen, San Diego, CA) was engineered to 
express an hTRT fusion protein comprising hTRT fused to an N-terminal extension epitope 
tag, the Xpress epitope (Invitrogen, San Diego, CA) (designated pGRN122). The NotI hTRT 

1 5 fragment from pGRN 1 2 1 containing the hTRT ORF was cloned into the NotI site of 

pEBVHisA so that the hTRT ORF is in the same orientation as the vector's Rous Sarcoma 
Virus (RSV) promoter. In this orientation the His6 flag was relatively closer to the 
N-terminus of hTRT. 

A vector was also constructed containing as an insert the antisense sequence of 

20 hTRT and the epitope tag (the plasmid designated pGRN123, which can be used as a control). 
The vector was transfected into 293 cells and translated hTRT identified and isolated using an 
antibody specific for the Xpress epitope. pEBVHis is a hygromycin resistant EBV episomal 
vector that expresses the protein of interest fused to a N-terminal peptide* Cells carrying the 
vector are selected and expanded, then nuclear and cytoplasmic extracts prepared. These and 

25 control extracts are immunoprecipitated with anti-Xpress antibody, and the 

immunoprecipitated beads are tested for telomerase activity by conventional assay. 
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Expressionm of Recombinant hTRT in Mortal, Normal Diploid Human Cells 

In one embodiment of the invention, recombinant hTRT and necessary 
telomerase enzyme complex components can be expressed in normal, diploid mortal cells to 
increase their proliferative capacity or to immortalize them, or to facilitate immortalizing 
5 them. This allows one to obtain diploid immortal cells with an otherwise normal phenotype 
and karotype. As discussed above, this use of telomerase has enormous commercial utility. 

Sense hTRT (Figure 16) and antisense hTRT were cloned into a CMV vector. 
These vectors were purified and transiently transfected into two normal, mortal, diploid 
human cell clones. The human clones were young passage diploid human BJ and IMR90 cell 
10 strains. 

Analysis of telomerase activity using a TRAP assay (utilizing the TRAPeze™ 
Kit (Oncor, Inc., Gaithersburg, MD) showed that transfection of sense hTRT - but not 
antisense hTRT - generated telomerase activity in both the BJ and IMR90 cell strains. 



1 5 Expression of Recombinant hTRT in Immoralized IMR90 Human Cells 

Using the same hTRT sense construct cloned into CMV vectors used in the above 
described diploid human BJ and IMR90 cell strains studies, immortalized SW13 ALT 
pathway cell line (an IMR90 cell immortalized with S V40 antigen) was transiently 
transfected. A TRAP assay (TRAPeze, Oncor, Inc, Gaithersburg, MD) demonstrated that 
20 telomerase activity was generated in the sense construct transfected cells. 

Vectors for Regulated Expression of hTRT in Mammalian Cells: Inducible and 
Repressible Expression of hTRT 

The invention provides vectors that can be manipulated to induce or repress 
25 the expression of the TRTs of the invention, such as hTRT. For example, the hTRT coding 

sequence can be cloned into the Ecdysone-Inducible Expression System from Invitrogen (San 
Diego, CA) and the Tet-On and Tet-off tetracycline regulated systems from Clontech 
Laboratories, Inc. (Palo Alto, CA). Such inducible expression systems are provided for use in 
the methods of the invention where it is important to control the level or rate of transcription 
30 of transfected TRT. For example, the invention provides for cell lines immortalized through 
the expression of hTRT; such cells can be rendered "mortal" by inhibition of hTRT 
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expression by the vector through transcriptional controls, such as those provided by the 
Tet-Off system. The invention also provides for methods of expressing TRT only transiently 
to avoid the constitutive expression of hTRT, which may lead to unwanted "immortalization" 
of the transfected cells, as discussed above. 

5 The Ecdysone-Inducible Mammalian Expression System is designed to allow 

regulated expression of the gene of interest in mammalian cells. The system is distinguished 
by its tightly regulated mechanism that allows almost no detectable basal expression and 
greater than 200-fold inducibility in mammalian cells. The expression system is based on the 
heterodimeric ecdysone receptor of Drosophila. The Ecdysone-Inducible Expression System 

10 uses a steroid hormone ecdysone analog, muristerone A, to activate expression of hTRT via a 
heterodimeric nuclear receptor. Expression levels have been reported to exceed 200-fold over 
basal levels with no effect on mammalian cell physiology "Ecdysone-Inducible Gene 
Expression in Mammalian Cells and Transgenic Mice" (1996) Proc. Natl Acad. Set USA 93, 
3346-3351). Once the receptor binds ecdysone or muristerone, an analog of ecdysone, the 

15 receptor activates an ecdysone-responsive promoter to give controlled expression of the gene 
of interest. In the Ecdysone-Inducible Mammalian Expression System, both monomers of the 
heterodimeric receptor are constitutively expressed from the same vector, pVgRXR. The 
ecdysone-responsive promoter, which ultimately drives expression of the gene of interest, is 
located on a second vector, pIND, which drives the transcription of the gene of interest. 

20 The hTRT coding sequence is cloned in the pIND vector (Clontech 

Laboratories, Inc, Palo Alto, CA), which contains 5 modified ecdysone response elements 
(E/GREs) upstream of a minimal heat shock promoter and the multiple cloning site. The 
construct is then transfected in cell lines which have been pre-engineered to stably express the 
ecdysone receptor. After transfection, cells are treated with muristerone A to induce 

25 intracellular expression from pIND. 

The Tet-on and Tet-off expression systems (Clontech, Palo Alto, CA) give 
access to the regulated, high-level gene expression systems described by Gossen (1992) 
"Tight control of gene expression in mammalian cells by tetracycline responsive promoters" 
Proc, Natl. Acad Set USA 89:5547-5551, for the Tet-Off transcription repression system; 

30 and Gossen (1995) "Transcriptional activation by tetracycline in mammalian cells" Science 
268:1766-1769, for the Tet-On inducible transcriptional system. In "Tet-Off transformed 
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cell lines, gene expression is turned on when tetracycline (Tc) or doxycycline ("Dox;" a Tc 
derivative) is removed from the culture medium. In contrast, expression is turned on in 
Tet-On cell lines by the addition of Tc or Dox to the medium. Both systems permit 
expression of cloned genes to be regulated closely in response to varying concentrations of Tc 
5 or Dox. 

This system uses the "pTRE" as a response plasmid that can be used to 
express a gene of interest. Plasmid pTRE contains a multiple cloning site (MCS) 
immediately downstream of the Tet-responsive PhCMV*-l promoter. Genes or cDNAs of 
interest inserted into one of the sites in the MCS will be responsive to the tTA and rtTA 

10 regulatory proteins in the Tet-Off and Tet-On systems, respectively. PhCMV*-l contains the 
Tet-responsive element (TRE), which consists of seven copies of the 42-bp tet operator 
sequence (tetO). The TRE element is just upstream of the minimal CMV promoter 
(PminCMV), which lacks the enhancer that is part of the complete CMV promoter in the pTet 
plasmids. Consequently, PhCMV*-l is silent in the absence of binding of regulatory proteins 

15 to the tetO sequences. The cloned insert must have an initiation codon. In some cases, 
addition of a Kozak consensus ribosome binding site may improve expression levels; 
however, many cDNAs have been efficiently expressed in Tet systems without the addition of 
a Kozak sequence. pTRE-Gene X plasmids are cotransfected with pTK-Hyg to permit 
selection of stable transfectants. 

20 Setting up a Tet-Off or Tet-On expression system generally requires two 

consecutive stable transfections to create a "double-stable" cell line that contains integrated 
copies of genes encoding the appropriate regulatory protein and TRT under the control of a 
TRE. In the first transfection, the appropriate regulatory protein is introduced into the cell line 
of choice by transfection of a "regulator plasmid" such as pTet-Off or pTet-On vector, which 

25 expresses the appropriate regulatory proteins. The hTRT cloned in the pTRE "response 

plasmid" is then introduced in the second transfection to create the double-stable Tet-Off or 
Tet-On cell line. Both systems give very tight on/off control of gene expression, regulated 
dose-dependent induction, and high absolute levels of gene expression. 

30 Expression Recombinant hTRT With DHFR and Adenovirus Sequences 

The pGRN155 plasmid construct was designed for transient expression of 
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hTRT cDNA in mammalian cells. A Kozak consensus is inserted at the 5' end of the hTRT 
sequence. The hTRT insert contains no 3' or 5' UTR. The hTRT cDNA is inserted into the 
EcoRI site of p91023(B) (Wong (1985) Science 228:810-815). The hTRT insert is in the 
same orientation as the DHFR ORF. 

5 Plasmid pGRNl 55 contains the SV40 origin and enhancer just upstream of an 

adenovirus promoter, a tetracycline resistance gene, an E. coli origin and an adenovirus VAI 
and VAII gene region. This expression cassette contains, in the following order: the 
adenovirus major late promoter; the adenovirus tripartite leader; a hybrid intron consisting of 
a 5' splice site from the first exon of the tripartite leader and a 3 ! splice site from the mouse 

10 immunoglobulin gene; the hTRT cDNA; the mouse DHFR coding sequence; and, the SV40 
polyadenylation signal. 

The adenovirus tripartite leader and the VA RNAs have been reported to 
increase the efficiency with which polycistronic mRNAs are translated. DHFR sequences 
have been reported to enhance the stability of hybrid mRNA. DHFR sequences also can 

1 5 provide a marker for selection and amplification of vector sequences. See Logan (1984) 

Proc. Natl Acad, Scl USA 81:3655); Kaufman (1985) Proc. Natl. Acad. Sci. USA 82: 689 ; 
and Kaufman (1988) Focus (Life Technologies, Inc.), Vol.10, no. 3). This makes the 
expression vector particularly useful for transient expression. 
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Other expression plamids of the invention are described for illustrative 

purposes. 
25 pGRN121 

The EcoRI fragment from lambda clone 25-1.1.6 containing the entire cDNA encoding hTRT 
protein was inserted into the EcoRI site of pBluescriptIISK+ such that the 5' end of the cDNA 
is near the T7 promoter in the vector. The selectable marker that is used with this vector is 
ampicillin. 

30 

pGRN122 
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The NotI fragment from pGRN121 containing the hTRT ORF was inserted into the NotI site 
of pEBVHisA so that the coding sequence is operably linked to the RSV promoter. This 
plasmid expresses a fusion protein composed of a His6 flag fused to the N-terminal of the 
hTRT protein. The selectable marker that is used with this vector is ampicillin or 
hygromycin. 

pGRN123 

The NotI fragment from pGRN121 containing the hTRT ORF was inserted into the NotI site 
of pEBVHisA so that the coding sequence is in the opposite orientation as the RSV promoter, 
thus expressing antisense hTRT. 

pGRN124 

Plasmid pGRN121 was deleted of all Apal sites followed by deletion of the MscI-HincII 
fragment containing the 3'UTR. The Nco-Xbal fragment containing the stop codon of the 
hTRT coding sequence was then inserted into the Nco-Xbal sites of pGRN121 to make a 
plasmid equivalent to pGRN121 except lacking the 3'UTR, which may be preferred for 
increased expression levels in some cells. 

pGRN125 

The NotI fragment from pGRN124 containing the hTRT coding sequence was inserted into 
the NotI site of pBBS235 so that the open reading frame is in the opposite orientation of the 
Lac promoter. The selectable marker that is used with this vector is chloramphenicol. 

pGRN126 

The NotI fragment from pGRN124 containing the hTRT coding sequence was inserted into 
the NotI site of pBBS235 so that the hTRT coding sequence inserted is in the same 
orientation as the Lac promoter. 

pGRN127 

The oligonucleotide 5 ! -TGCGCACGTGGGAAGCCCTGGCagatctgAattCCaCcATGC 
CGCGCGCTCCCCGCTG-3 1 was used in in vitro mutagenesis of pGRN125 to convert the 
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initiating ATG codon of the hTRT coding sequence into a Kozak consensus sequence and 
create EcoRI and Bglll sites for cloning. Also, oligonucleotide COD2866 was used to 
convert AmpS to AmpR (ampicillin resistant) and oligonucleotide COD 1941 was used to 
convert CatR (chloramphenicol resistant) to CatS (chloramphenicol sensitive). 

5 

pGRN128 

The oligonucleotide 5'-TGCGCACGTGGGAAGCCCTGGCagatctgAattCCaCcATG 
CCGCGCGCTCCCCGCTG-3' is used in in vitro mutagenesis to convert the initiating ATG 
codon of hTRT into a Kozak consensus and create EcoRI and Bglll sites for cloning. Also, 
10 oligo 5'-CTGCCCTCAGACTTCAAGACCATCCTGGACTACAA 

GGACGACGATGACAAATGAATTCAGATCTGCGGCCGCCACCGCGGTGGAGCTCC 
AGC-3' is used to insert the IBI Flag (International Biotechnologies Inc. (IBI), Kodak, New 
Haven, CT) at the C-terminus and create EcoRI and Bglll sites for cloning. Also, COD2866 
is used to convert AmpS to AmpR and COD 1941 is used to convert CatR to CatS. 

15 

pGRN129 

The oligonucleotide 5'-CGGGACGGGCTGCTCCTGCGTTTGGTGGAcGcgTTCTTG 
TTGGTGAC ACCTC ACCTC ACC-3 ' was used by in vitro mutagenesis to convert Asp869 to 
an Ala codon (i.e. the second Asp of the DD motif was converted to an Alanine to create a 
20 dominant/negative hTRT mutant). This also created a Mlul site. Also, oligonucleotide 5'- 
CTGCCCTCAGACTTCAAGACCATCCTGGACTACAAGG 

ACGACGATGACAAATGAATTCAGATCTGCGGCCGCCACCGCGGTGGAGCTCCAG 
C-3*) was used to insert the IBI Flag at the C-terminus and create EcoRI and Bglll sites for 
cloning. Also, COD2866 was used to convert AmpS to AmpR and COD 1941 was used to 
25 convert CatR to CatS. 

pGRN130 

The oligonucleotide 5'-CGGGACGGGCTGCTCCTGCGTTTGGTGGAcGcgTTCTT 
GTTGGTGACACCTCACCTCACC-3* was used in in vitro mutagenesis to convert the 
30 Asp869 codon into an Ala codon (i.e. the second Asp of the DD motif was converted to an 
Alanine to make a dominant/negative variant protein). This also created an Mlul site. Also, 
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the oligonucleotide S'-TGCGCACGTGGGAAGCCCTGGCagatctgAatt 
CCaCcATGCCGCGCGCTCCCCGCTG-3' was used in in vitro mutagenesis to convert the 
initiating ATG codon of the hTRT coding sequence into a Kozak consensus sequence and 
create EcoRI and Bglll sites for cloning. Also, COD2866 was used to convert AmpS to 
5 AmpR and COD 1 94 1 was used to convert CatR. 

pGRN131 

The EcoRI fragment from pGRN128 containing the hTRT ORF with Kozak sequence and IBI 
Flag mutations is inserted into the EcoRI site of pBBS212 so that the hTRT ORF is expressed 
10 off the MPSV promoter. Plasmid pBSS212 contains a MPSV promoter, the CMV enhancer, 
and the SV40 polyadenylation site. 

pGRN132 

The EcoRI fragment from pGRN128 containing the hTRT ORF with Kozak sequence and IBI 
15 Flag mutations is inserted into the EcoRI site of pBBS212 so that the antisense of the hTRT 
ORF is expressed off the MPSV promoter. 

pGRN133 

20 The EcoRI fragment from pGRN121 containing the hTRT coding sequence was inserted into 
the EcoRI site of pBBS212 so that the hTRT protein is expressed under the control of the 
MPSV promoter. 

pGRN134 

25 The EcoRI fragment from pGRN121 containing the hTRT coding sequence was inserted into 
the EcoRI site of pBBS212 so that the antisense of the hTRT coding sequence is expressed 
under the control of the MPSV promoter. The selectable markers used with this vector are 
Chlor/HygB/PAC. 

30 pGRN135 

Plasmid pGRN126 was digested to completion with MscI and Smal and religated to delete 

250 



'mm nil! ■ 1 1 m 



# • 

over 95% of the hTRT coding sequence inserted. One Smal-MscI fragment was re-inserted 
during the process to recreate the Cat activity for selection. This unpurified plasmid was then 
redigested with Sail and EcoRI and the fragment containing the initiating codon of the hTRT 
coding sequence was inserted into the Sall-EcoRI sites of pBBS212. This makes an antisense 
5 expression plasmid expressing the antisense of the 5'UTR and 73 bases of the coding 
sequence. The selectable markers used with this vector are Chlor/HygB/PAC. 

pGRN136 

The Hindlll-Sall fragment from pGRN126 containing the hTRT coding sequence was 
1 0 inserted into the Hindlll-Sall sites of pBBS242. 

pGRN137 

The SalI-Sse8387I fragment from pGRN130 containing the Kozak sequence was inserted into 
the SalI-Sse8387I sites of pGRN136. 

15 

pGRN138 

The EcoRI fragment from pGRN124 containing hTRT minus the 3UTR was inserted into the 
EcoRI site of pEGFP-C2 such that the orientation of the hTRT is the same as the EGFP 
domain. 

20 

pGRN139 

The oligonucleotide 5*- CTGCCCTCAGACTTCAAGACCATCCTGGACTACAAGG 
ACGACGATGACAAATGAATTCAGATCTGCGGCCGCCACCGCGGTGGAGCTCCAG 
C-3 was used to insert the IBI Flag at the C-terminus of hTRT in pGRN125 and create EcoRI 
25 and Bglll sites for cloning. Also, COD2866 was used to convert AmpS to AmpR) and 
COD1941 was used to convert CatR to CatS. 

pGRN140 

The Ncol fragment containing the upstream sequences of genomic hTRT and the first intron 
30 of hTRT from lambdaG55 was inserted into the Ncol site of pBBS 1 67. The fragment is 
oriented so that hTRT is in the same direction as the Lac promoter. 
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pGRN141 

The Ncol fragment containing the upstream sequences of genomic hTRT and the first intron 
of hTRT from lambdaG55 was inserted into the Ncol site of pBBS167. The fragment is 
oriented so that hTRT is in the opposite direction as the Lac promoter. 

5 

pGRN142 

The NotI fragment from lambdaGphiS containing the complete -15 kbp genomic insert 
including the hTRT gene promoter region was inserted in the NotI site of plasmid pBBS185. 
The fragment is oriented so that the hTRT ORF is in the opposite orientation as the Lac 
10 promoter. 

pGRN143 

The NotI fragment from lambdaGphiS containing the complete -15 kbp genomic insert 
including the hTRT gene promoter region was inserted in the NotI site of plasmid pBBS185. 
1 5 The fragment is oriented so that the hTRT ORF is in the same orientation as the Lac 
promoter. 

pGRN144 

SAL1 deletion of pGRN140 to remove lambda sequences. 

20 

pGRN145 

This vector was constructed for the expression of hTRT sequences in mammalian cells. The 
EcoRI fragment from pGRN137 containing the hTRT coding sequence was inserted into the 
EcoRI site of pBBS212 to remove the portion of the sequence corresponding to the 5'UTR of 
25 hTRT mRNA. The hTRT coding sequence is oriented so that it is expressed under the 
control of the MPSV promoter. The selectable markers used with this vector are 
Chlor/HygB/PAC. 

pGRN146 

30 This vector was constructed for the expression of hTRT sequences in mammalian cells. The 
Sse8387I-NotI fragment from pGRN130 containing the D869A mutation of hTRT was 
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inserted into the Sse8387I-NotI sites of pGRN137. The selectable markers used with this 
vector are Ampicillin/HygB/PAC. 

pGRN147 

5 The Sse8387I-NotI fragment from pGRN139 containing the IBI Flag was inserted into the 
Sse8387I-NotI sites of pGRN137. 

pGRN148 

The BglII-Eco47III fragment from pGRN144 containing the promoter region of hTRT was 
1 0 inserted into the Bglll-Nrul sites of pSEAP2 to make an hTRT promoter/reporter construct. 

pGRN149 

This vector is an intermediate vector for constructing a hTRT fusion protein expression 
vector. The mutagenic oligo S'-cttcaagaccatcctggactttcgaaacgcggccgccaccg 
15 cggtggagctcc-3' was used to add a CSP45I site at the C-terminus of hTRT by in vitro 
mutagenesis of pGRN125. The "stop" codon of hTRT was deleted and replaced with a 
Csp45I site. The selectable marker that is used with this vector is ampicillin. 

pGRNlSO 

20 The Bglll-Fspl fragment from pGRN144 containing the promoter region of hTRT was 

inserted into the Bglll-Nrul sites of pSEAP2 to make an hTRT promoter/reporter construct. 

pGRNlSl 

This vector was constructed for the expression of hTRT sequences in mammalian cells. The 
25 EcoRI fragment from pGRN147 containing the hTRT coding sequence was inserted into the 
EcoRI site of pBBS212 to remove the portion of the sequence corresponding to the 5'UTR of 
the hTRT mRNA. The hTRT coding sequence is oriented so that it is expressed under the 
control of the MPSV promoter. The selectable markers used with this vector are 
Chlor/HygB/PAC. 

30 

pGRN152 
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The EcoRI fragment from pGRN146 containing the hTRT coding sequence was inserted into 
the EcoRI site of pBBS212 to remove the portion of the sequence corresponding to the 
5'UTR of the hTRT. The hTRT coding sequence is oriented so that it is expressed under the 
control of the MPSV promoter. 

pGRN153 

The Styl fragment from pGRNBO containing the D869-->A mutation of hTRT (hTRT 
variant coding sequence) was inserted into the Styl sites of pGRN158 to make a plasmid 
containing the hTRT coding sequence with a Kozak consensus sequence at its 5*-end, an IBI 
FLAG sequence at its 3'-end (the C-terminus encoding region), and the D869->A mutation. 

pGRN154 

The EcoRI fragment of pGRN153 containing the hTRT gene was inserted into the EcoRI site 
of plasmid pBS212 in an orientation such that the hTRT ORF is oriented in the same 
direction as the MPSV promoter. This makes an MPSV-directed expression plasmid that 
expresses the hTRT protein with a Kozak consensus sequence at its amino-terminal end, an 
IBI FLAG at its carboxy^erminal end, and the D869~>A mutation 

pGRN155 

This vector was constructed for the expression of hTRT sequences in mammalian cells. The 
insert included full length cDNA of hTRT minus 5' and 3' UTR, and Kozak sequences. The 
EcoRI fragment from pGRN145 containing the hTRT cDNA with the Kozak consensus and 
no 3' or 5' UTR was inserted into the EcoRI site of p9 1023(B) such that the hTRT is in the 
same orientation as the DHFR ORF. This makes a transient expression vector for hTRT. The 
selectable marker used with this vector is tetracycline. 

pGRN156 

This vector was constructed for the expression of hTRT sequences in mammalian cells. The 
EcoRI fragment from pGRN146 containing the D869A mutation of the hTRT cDNA with the 
Kozak consensus and no 3' or 5' UTR was inserted into the EcoRI site of p91023(B) such that 
the hTRT is in the same orientation as the DHFR ORF. This makes a transient expression 
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vector for hTRT. The insert included full length cDNA of hTRT minus 5' and 3' UTR, 
D869A, and Kozak sequences. The selectable marker used with this vector is tetracycline. 

pGRN157 

This vector was constructed for the expression of hTRT sequences in mammalian cells. The 
EcoRI fragment from pGRN147 containing the hTRT cDNA with the IBI FLAG at the C- 
terminus; the Kozak consensus and no 3' or 5' UTR into the EcoRI site of p91023(B) such 
that the hTRT is in the same orientation as the DHFR ORF. This makes a transient expression 
vector for hTRT. The insert included full length cDNA of hTRT minus 5' and 3' UTR, the IBI 
FLAG sequence, and Kozak sequences. The selectable marker used with this vector is 
tetracycline. 

pGRN158 

This vector was constructed for the expression and mutagenesis of TRT sequences in E. coli. 
The EcoRI fragment from pGRN151 containing the hTRT ORF was inserted into the EcoRI 
site of pBBSl 83 so that the hTRT ORF is oriented in the opposite direction as the Lac 
promoter. The insert included full length cDNA of hTRT minus 5' and 3' UTR, IBI FLAG 
sequence, and Kozak sequences. The hTRT coding sequence is driven by a T7 promoter. 
The selectable marker used with this vector is amphicillin. 

pGRN159 

This vector was constructed for the expression and mutagenesis of TRT sequences in K coli. 
The Nhel-Kpnl fragment from pGRN138 containing the EGFP to hTRT fusion was inserted 
into the Xbal-Kpnl sites of pBluescriptIIKS+. This makes a T7 expression vector for the 
fusion protein (the coding sequence is driven by a T7 promoter). The insert included full 
length cDNA of hTRT minus the 3' UTR as a fusion protein with EGFP. The selectable 
marker used with this vector is amphicillin. 

pGRN160 

This vector was constructed for the expression of antisense hTR sequences in mammalian 
cells. The coding sequence is operably linked to an MPSV promoter. The Xhol-Nsil 
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fragment from pGRN90 containing the full length hTR ORF was inserted into the Sall- 
Sse8387I sites of pBBS295. This makes a transient/stable vector expressing hTR antisense 
RNA. A GPT marker was incorporated into the vector. The selectable markers used with 
this vector are Chlor/gpt/PAC. 

pGRN161 

This vector was constructed for the expression of sense hTR sequences in mammalian cells. 
The Xhol-Nnil fragment from pGRN89 containing the full length hTR ORF was inserted into 
the SalI-Sse8387I sites of pBBS295. This makes a transient/stable vector expressing hTR in 
the sense orientation. The coding sequence is driven by an MPSV promoter. A GPT marker 
was incorporated into the vector. The selectable markers used with this vector are 
Chlor/gpt/PAC. 

pGRN162 

The Xhol-Nsil fragment from pGRN87 containing the full length hTR ORF was inserted into 
the SalI-Sse8387I sites of pBBS295. This makes a transient/stable vector expressing 
truncated hTR (from position +108 to +435) in the sense orientation. 

pGRN163 

This vector was constructed for the expression and mutagenesis of TRT sequences in E. coli. 
The coding sequence is driven by a T7 promoter. Oligonucleotide RA45 
(5'-GCCACCCCCGCGCTGCCTCGAGCTCCCCGCTGC-3') is used in in vitro mutagenesis 
to change the initiating met in hTRT to Leu and introduce an Xhol site in the next two codons 
after the Leu. Also COD 1941 was used to change CatR to CatS, and introduces a BSPH1 
site, and COD 2866 was used to change AmpS to AmpR, introducing an FSP1 site. The 
selectable marker used with this vector is amphicillin. 

pGRN164 

This vector was constructed for the expression of hTR sequences in E. coli. Primers hTR+1 

5'-GGGGAAGCTTTAATACGACTCACTATAGGGTTGCGGAGGGTGG 

GCCTG-3' andhTR+445 5-CCCCGGATCCTGCGCATGTGTGAGCCGAGTCCT 
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GGG-3'were used to amplify by PCR a fragment from pGRN33 containing the full length 
hTR with the T7 promoter on the 5' end (as in hTR+1). A BamHI-Hindlll digest of the PCR 
product was put into the BamHI-Hindlll sites of pUCl 19. The coding sequence operably 
linked to a T7 promoter. The selectable marker used with this vector is amphicillin. 
5 pGRNl 64 is also called phTR+1 . 

pGRN165 

This vector was constructed for the expression and mutagenesis of hTRT sequences in E. coli. 
The coding sequence is operably linked to a T7 promoter. The EcoRI fragment from 
1 0 pGRN 1 45 containing the hTRT ORF with a Kozak front end was inserted into the EcoRI site 
of pBluescriptIISK+ so that the hTRT is oriented in the same direction as the T7 promoter. 
The selectable marker used with this vector is amphicillin. 

pGRN166 

1 5 This vector was constructed for the expression and mutagenesis of TRT sequences in 

mammalian cells. The coding sequence is operably linked to a T7 promoter. The EcoRI 
fragment from pGRNl 5 1 containing the hTRT ORF with a Kozak front end and IBI flag at 
the back end was inserted into the EcoRI site of pBluescriptIISK+ so that the hTRT ORF is 
oriented in the same direction as the T7 promoter. The insert included full length cDNA of 

20 hTRT minus 5' and 3' UTR, FLAG sequence (Immunex Corp, Seattle WA), and Kozak 
sequences. The selectable marker used with this vector is amphicillin. 

pGRN167 

AvRH-StuI fragment from pGRN144 containing the 5' end of the hTRT ORF was inserted 
25 into the Xbal-StuI sites of pBBS161. 

pGRN168 

The EcoRI fragment from pGRN145 containing the optimized hTRT expression cassette was 
inserted into the EcoRI site of pIND such that the hTRT coding sequence is in the same 
30 orientation as the miniCMV promoter. 
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pGRN169 

The EcoRI fragment from pGRN145 containing the optimized hTRT expression cassette was 
inserted into the EcoRI site of pIND such that the hTRT is in the reverse orientation from the 
miniCMV promoter. 

5 

pGRN170 

The EcoRI fragment from pGRN145 containing the optimized hTRT expression cassette was 
inserted into the EcoRI site of pIND(spl) such that the hTRT is in the opposite orientation 
from the miniCMV promoter. 

10 

pGRN171 

The Eco47III-NarI fragment from pGRN163 was inserted into the Eco47III-NarI sites of 
pGRN167, putting the MIL mutation into a fragment of the hTRT genomic DNA. 

15 pGRN172 

The BamHI-StuI fragment from pGRN171 containing the Met to Leu mutation in the hTRT 
ORF was inserted into the Bglll-Nrul sites of pSEAP2-Basic. 

pGRN173 

20 The EcoRV-EC047III fragment from pGRN144 containing the 5' end of the hTRT 
promoter region was inserted into the SrfI-Eco47III sites of pGRN172. This makes a 
promoter reporter plasmid that contains the promoter region of hTRT from approximately 2.3 
kb upstream from the start of the hTRT ORF to just after the first intron in the coding region, 
with the Metl->Leu mutation. 

25 

pGRN174 

The EcoRI fragment from pGRN145 containing the "optimized" hTRT expression cassette 
was inserted into the EcoRI site of pIND(spl) such that the hTRT is in the same orientation 
as the miniCMV promoter. 

30 
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EXAMPLE 7 
RECONSTITUTION OF TELQMERASE ACTIVITY 
A. Co-Expression of hTRT and hTR in vitro 

In this example, the coexpression of hTRT and hTR using an in vitro cell-free 
5 expression system is described. These results demonstrate that the hTRT polypeptide 
encoded by pGRN121 encodes a catalytically active telomerase protein and that in vitro 
reconstitution (IVR) of the telomerase RNP can be accomplished using recombinantly 
expressed hTRT and hTR. 

Telomerase activity was reconstituted by adding linearized plasmids of hTRT 

10 (pGRN121; 1 ^g DNA digested with Xba I) and hTR (phTRf 1; 1 fig digested with Fspl) to a 
coupled transcription-translation reticulocyte lysate system (Promega TNT™). phTR+1 is a 
plasmid which, when linearized with Fspl and then transcribed by T7 RNA polymerase, 
generates a 445 nucleotide transcript beginning with nucleotide +1 and extending to 
nucleotide 446 of hTR (Autexier et al., 1996, EMBOJ 15:5928). For a 50 jlxI reaction the 

15 following components were added: 2 jil TNT™ buffer, 1 TNT™ T7 RNA polymerase, 1 
jul 1 mM amino acid mixture, 40 units Rnasin™ RNase inhibitor, 1 ng each linearized 
template DNA, and 25 |il TNT™ reticulocyte lysate. Components were added in the ratio 
recommended by the manufacturer and were incubated for 90 min at 30 °C, Transcription was 
under the direction of the T7 promoter and could also be carried out prior to the addition of 

20 reticulocyte lysate with similar results. After incubation, 5 and 10 \xl of the programmed 
transcription-translation reaction were assayed for telomerase activity by TRAP as 
previously described (Autexier et al., supra) using 20 cycles of PGR to amplify the signal. 

The results of the reconstitution are shown in Figure 10. For each 
transcription/translation reaction assayed there are 3 lanes: The first 2 lanes are duplicate 

25 assays and the third lane is a duplicate sample heat denatured (95 °C, 5 min) prior to the 
TRAP phase to rule out PCR generated artifacts. 

As shown in Figure 10, reticulocyte lysate alone has no detectable telomerase 
activity (lane 6). Similarly, no detectable activity is observed when either hTR alone (lane 1) 
or full length hTRT gene (lane 4) are added to the lysate. When both components are added 

30 (lane 2), telomerase activity is generated as demonstrated by the characteristic repeat ladder 
pattern. When the carboxyl-terminal region of the hTRT gene is removed by digestion of the 
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vector with Ncol ("truncated hTRT") telomerase activity is abolished (lane 3). Lane 5 shows 
that translation of the truncated hTRT alone does not generate telomerase activity. Lane "R8" 
shows a positive control for a telomerase product ladder generated by TRAP of TSR8, a 
synthetic telomerase product having a nucleotide sequence of 
5 5'-ATTCCGTCGAGCAGAGTTAG[GGTTAG] 7 -3'. 

It was also observed that purification of IVR telomerase resulted in a stronger 
signal and/or reduced background in certain telomerase activity assays. In some experiments, 
IVR telomerase activity from co-synthesized components was enriched by fractionation of 
TnT reactions over DEAE anion exchange membranes (Millipore Ultrafree-MC): 200 yd of 

1 0 the hTRT/hTR TnT reaction was passed through a single DEAE membrane. The membrane 
was washed with 400 (il of 0.2 M NaCl in buffer A (20 mM HEPES-KOH pH 7.9, 2 mM 
MgCl 2 , 1 mM EGTA, 10% glycerol, 0.1% Nonidet P-40, 0.1 mM phenylmethylsulfonyl 
fluoride) and IVR telomerase was eluted from the membrane with 80 jal of 1 M NaCl in 
buffer A. Alternatively, batch chromatography was used: 400 ^il of the TnT reaction was 

15 partially purified by batch chromatography using 25 |il of Toso-Haas Q-650M resin. After 
binding telomerase to the resin, it was washed with 0. 1 M NaCl in buffer A, followed by a 
second wash with 0.18 M NaCl in buffer A and eluted with 100 \il of 0.3 M NaCl in buffer A. 

B. Mixing of hTRT and hTR in vitro 

20 In vitro reconstitution of telomerase activity was also accomplished by mixing. 

hTRT was transcribed and translated as described supra, but without the addition of the hTR 
plasmid. Reconstitution of the telomerase RNP was then accomplished by mixing the hTRT 
translation mixture with hTR (previously generated by T7 RNA polymerase transcription 
from phTR+l-Fsp) in the ratio of 2 |al of hTRT translation mix to 2 ^1 of hTR (1 ug) then 

25 incubated for 90 minutes at 30° C. The reaction conditions were adjusted to a KC1 

concentration of about 0.2 M. (The presence of KC1 at a concentration of about 0.1 M to 
about 1 .0 M may enhance telomerase activity or telomerase reconstitution in IVR). This 
method of hTRT/hTR reconstitution is referred to as "linked reconstitution" or "linked IVR." 
Telomerase activity is present (i.e., can be detected) in this mixture. Improved signal was 

30 observed following partial purification of the activity by DEAE chromatography. In this case 
Millipore Ultrafree-MC DEAE Centrifugal Filter Devices were used according to the 
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manufacturer's directions). The buffers used were hypoO.l, hypo0.2, and hypo 1.0, where 
hypo is 20 mM Hepes-KOH, pH 7.9, 2 mM MgC12, 1 mM EGTA, 10 % glycerol, 0.1 % 
NP-40, 1 mM DTT, 1 mM Na-metabisulfite, 1 mM benzamidine, and 0.2 mM 
phenylmethylsulfonylflouride (PMSF), and where 0.1, 0.2 and 1.0 refers to 0.1, 0.2 or 1.0 M 
5 KCL. The filters were pre-conditioned with hypo 1 .0, washed with hypoO. 1 , the reconstituted 
telomerase was loaded, the column was washed with hypoO.l then hypo0.2, and the 
reconstituted telomerase was eluted with hypo 1 .0 at half the volume as was loaded. This 
formulation could be stored frozen at -70 °C and retains activity. 

Telomerase activity was assayed in a two step procedure. In step one, a 

10 conventional telomerase assay was performed as described in Morin, 1989, Cell 59: 521, 

except no radiolabel was used. In step two, an aliquot was assayed by the TRAP procedure 
for 20-30 cycles as described supra. The conventional assay was performed by assaying 1-10 
|il of reconstituted telomerase in 40-50 \xl final volume of 25 mM Tris-HCl, pH 8.3, 50 mM 
K-acetate, 1 mM EGTA, 1 mM MgC12, 2 mM dATP, 2 mM TTP, 10 uM dGTP, and 1 uM 

15 primer (usually M2, S'-AATCCGTCGAGCAGAGTT) at 30° C for 60-1 80 minutes. The 
reaction was stopped by heating to 95° C for 5 minutes and 1-10 pi of the first step mixture 
was carried onto the step two TRAP reaction (50 ul). 

In additional experiments, the synthesis of hTRT and hTR during in vitro 
reconstitution was monitored by 35 S-methionine incorporation and Northern blotting, 

20 respectively. Proteins of approximately the predicted size were synthesized for hTRT (127 
kD), hTRT-Nco (85 kD), and pro90hTRT (90 kD) in approximately equal molar amounts 
relative to each other. The Northern analysis indicated hTR synthesis was the correct size 
(445 nucleotides) and predominantly intact. 

High levels of reconstitution and telomerase activity were also obtained with 2 

25 jag of linearized pGRN121 in a 50 \xl TnT reaction as described supra (Example 7 A) except 
that in place of the hTR template, 4 pmol (0.6 ng) of hTR RNA (previously generated by T7 
RNA polymerase transcription from phTR+l-Fsp) was added at the beginning of the TnT 
reaction and the reaction was incubated at 30 °C for 90-120 minutes. Slightly greater (2-5 
times) activity was achieved using 1 |ig of supercoiled XhTRT-E and 16 pmol (2.4 |ig) of 

30 pre-synthesized hTR RNA set up in a 50 jil TnT reaction, as described supra, with incubation 
at 30°C for 90-120 minutes. XhTRT-E is an hTRT construct in the pcDNA3.1/His Xpress 
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vector (Invitrogen) in which an optimized ribosome recognition site (Kozak consensus), six 
histidine residues, and an epitope tag are fused with the hTRT open reading frame. 

Variations of the reconstitution protocols, supra, will be apparent to those of 
skill. For example, the time and temperature of reconstitution, and presence or concentration 

5 of components such as monovalent salt (e.g.. NaCl, KC1, potassium acetate, potassium 
glutamate, and the like), divalent salt (MgCl 2 , MnCl 2 , MgS0 4 , and the like), denaturants 
(urea, formamide, and the like), detergents (NP-40, Tween, CHAPS, and the like), and 
alternative improved purification procedures (such as immunoprecipitation, affinity or 
standard chromatography) can be employed. These and other parameters can be varied in a 

1 0 systematic way to optimize conditions for particular assays or other reconstitution protocols. 

C. Reconstitution Using hTRT Variants and Fusion Proteins 

Reconstitution of telomerase catalytic activity occurred when EGFP-hTRT, a 
fusion of the enhanced green fluorescent protein to hTRT (see Examples 6 and 15), or 
15 epitope-tagged hTRT (IBI FLAG, see Example 6) both reconstituted telomerase activity at 
approximately wild-type levels were coexpressed with hTR. 

In contrast, telomerase activity was not reconstituted when a variant hTRT, 
pro90hTRT (missing RT motifs B f , C, D, and E) was used. This demonstrates that 
pro90hTRT does not possess full telomerase catalytic activity, although it may have other 
20 partial activities (e.g., RNA [i.e. hTR] binding ability and function as dominant-negative 
regulator of telomerase in vivo as described supra). 

D. Assay of in vitro Reconstituted Telomerase Activity Using the Gel Blot and 
Conventional Telomerase Assay 

25 The following example demonstrates that in vitro reconstituted (IVR) 

telomerase can be assayed using conventional telomerase assays in addition to 
amplification-based assays (i.e., TRAP). IVR telomerase as described in part (B), supra (the 
"linked reconstitution method") followed by DEAE purification, as described supra was 
assayed using the gel blot assay using the following reaction conditions; 1-10 ^1 of linked 

30 IVR telomerase in 40 \xl final volume of 25 mM Tris-HCl, pH 8.3, 50 mM K-acetate, 1 mM 
EGTA, 1 mM MgC12, 0.8 mM dATP, 0.8 mM TTP, 1 .0 mM dGTP, and 1 uM primer ( M2, 
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supra; or H3.03, 5'-TTAGGGTTAGGGTTAGGG) at 30°C for 1 80 minutes. The telomeric 
DNA synthesized was isolated by standard procedures, separated on a 8 % polyacrylamide, 8 
M urea gel, transfered to a nylon membrane, and probed using the 32 P-(CCCTAA)n riboprobe 
used in the dot-blot assay. The probe identified a six nucleotide ladder in the lane 
5 representing 10 [il of IVR telomerase that was equivalent to the ladder observed for 5 \xl of 
native nuclear telomerase purified by mono Q and heparin chromatography. The results show 
that IVR telomerase possesses processive telomerase catalytic activity equivalent to native 
telomerase. 

Linked IVR telomerase was also assayed by the conventional 32 P-dGTP 
10 incorporation telomerase assay. IVR telomerase prepared by the linked reconstitution method 
followed by DEAE purification, as described above, was assayed under both processive and 
non-processive reaction conditions. Assay conditions were 5-10 jal of linked IVR telomerase 
in 40 |il final volume of 25 mM Tris-HCl, pH 8.3, 50 mM K-acetate, 1 mM EGTA, 1 mM 
MgC12, 2 mM dATP, 2 mM TTP, with 10 uM 32 P-dGTP (72 Ci/mmol) [for assay of 
15 processive conditions] or 1 uM 32 P-dGTP (720 Ci/mmol) [for non-processive], and 1 uM 
primer (i.e., H3.03, supra) at 30°C [for the processive reaction] or 37°C [for the 
non-processive reaction] for 1 80 minutes. The telomeric DNA synthesized was isolated by 
standard procedures and separated on a 8 % polyacrylamide, 8 M urea gel sequencing gel. 
The processive reaction showed a weak six nucleotide ladder consistent with a processive 
20 telomerase reaction, and the non-processive reaction added one repeat, a pattern equivalent to 
a control reaction with a native telomerase preparation. Conventional assays using IVR 
telomerase are useful in screens for telomerase modulators, as described herein, as well as 
other uses such as elucidation of the structural and functional properties of telomerase. 

25 E. In vitro Reconstituted Telomerase Recognizes Primer 3* Termini 

This experiment demonstrates that IVR telomerase recognizes primer 3 f 
termini equivalently to native (purified) telomerase. Telomerase forms a base-paired duplex 
between the primer 3' end and the template region of hTR and adds the next specified 
nucleotide (Morin, 1989, supra). To verify that IVR (recombinant) telomerase has the same 
30 property, the reactions of primers with — GGG or — TAG 3' termini 

(AATCCGTCGAGCAGAGGG and AATCCGTCGAGCAGATAG) were compared to a 
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primer having a — GTT 3 ? terminus (M2 supra) using IVR and native telomerase assayed by 
the two step conventional/TRAP assay detailed above. The product ladders of the — GGG 
and — TAG primers were shifted +4 and +2, respectively, when compared to the standard 
primer ( — GTT 3' end), the same effect as was observed with native telomerase. This 
experiment demonstrates IVR and native telomerases recognize primer termini in a similar 
manner. 

These results (along with the results supra showing that IVR telomerase 
possesses both processive and non-processive catalytic activity) indicate that IVR telomerase 
has similar structure and properties compared to native or purified telomerase. 



EXAMPLE 8 
PRODUCTION OF ANTI-hTRT ANTIBODIES 
A. Production of Anti-hTRT Antibodies Against hTRT Peptides 

To produce anti-hTRT antibodies, the following peptides from hTRT were 
1 5 synthesized with the addition of C (cysteine) as the amino terminal residue (see Figure 54). 
S-l : FFY VTE TTF QKN RLF FYR KSV WSK 
S-2: RQH LKR VQL RDV SEA EVR QHR EA 
S-3 : ART FRR EKR AER LTS RVK ALF S VL NYE 
A-3: PAL LTS RLR FIP KPD GLR PIV NMD YVV 
20 The cysteine moiety was used to immobilize (i.e., covalently link) the peptides to BSA and 
KLH [keyhole limpet hemocyanin] carrier proteins. The KLH-peptides were used as antigen. 
The BSA-peptide conjugates served as material for ELISAs for testing the specificity of 
immune antisera. 

The KLH-peptide conjugates were injected into New Zealand White rabbits. 

25 The initial injections are made by placing the injectant proximal to the axillary and inguinal 
lymph nodes. Subsequent injections were made intramuscularly. For initial injections, the 
antigen was emulsified with Freund's complete adjuvant; for subsequent injections, Freund's 
incomplete adjuvant was used. Rabbits follow a three week boost cycle, in which 50 ml of 
blood yielding 20-25 ml of serum is taken 10 days after each boost. Antisera against each of 

30 the four peptides recognized the hTRT moiety of recombinant hTRT fusion protein 
(GST-HIS 8 -hTRT-fragment 2426 to 3274); see Example 6) on western blots. 
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Using a partially purified telomerase fraction from human 293 cells 



(approximately 1000-fold purification compared to a crude nuclear extract) that was produced 
as described in PCT application No. 97/06012 and affinity purified anti-S-2 antibodies, a 130 
kd protein doublet could be detected on a western blot. A sensitive chemiluminescence 
5 detection method was employed (SuperSignal chemiluminescence substrates, Pierce) but the 
signal on the blot was weak, suggesting that hTRT is present in low or very low abundance in 
these immortal cells. The observation of a doublet is consistent with a post-translational 
modification of hTRT, i.e., phosphorylation or glycosylation. 



1 0 (Pierce, Rockford IL) through its N-terminal Cysteine residue according to the manufacturer's 
protocol. First bleed serum from a rabbit immunized with the KLH-S-2 peptide antigen was 
loaded over a the S-2-SulfoLink and antibodies specifically bound to the S-2 peptide were 
eluted. 

15 B. Production of Anti-hTRT Antibodies Against hTRT Fusion Proteins 

GST-hTRT fusion proteins were expressed in E. coli as the GST-hTRT 
fragment #4 (nucleotides 3272-4177) and the GST-HIS8 -hTRT fragment #3 (nucleotides 
2426 to 3274) proteins described in Example 6. The fusion proteins were purified as 
insoluble protein, and the purity of the antigens was assayed by SDS polyacrylamide gels and 
20 estimated to be about 75% pure for the GST-hTRT fragment #4 recombinant protein and 
more than 75% pure for GST-HIS8 -hTRT fragment #3 recombinant protein. Routine 
methods may be used to obtain these and other fusion proteins at a purity of greater than 
90%. These recombinant proteins were used to immunize both rabbits and mice, as described 
above. 

25 The first and second bleeds from both the mice and rabbits were tested for the 

presence of anti-hTRT antibodies after removal of anti-GST antibodies using a matrix 
containing immobilized GST. The antisera were tested for anti-hTRT antibodies by Western 
blotting using immobilized recombinant GST-hTRT fusion protein, and by 
immunoprecipitation using partially purified native telomerase enzyme. While no signal was 

30 observed in these early bleeds, titers of anti-hTRT antibodies, as expected, increased in 
subsequent bleeds. 



For affinity purification, the S-2 peptide was immobilized to SulfoLink 
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EXAMPLE 9 

DETECTION OF AN hTRT mRNA CORRESPONDING TO A182 RNA VARIANT 

Poly A + RNA from human testis and the 293 cell line was analyzed for hTRT 
mRNA using RT-PCR and nested primers. The first primer set was TCP LI and TCP 1.1 5; 

5 the second primer set was TCP1 . 14 and BTCP6. Amplification from each gave two products 
differing by 1 82 bp; the larger and smaller products from testis RNA were sequenced and 
found to correspond exactly to pGRNl 21 (Figure 16) and the 712562 clone (Figure 18), 
respectively. The variant hTRT RNA product has been observed in mRNA from SW39i, 
OVCAR4, 293, and Testes. 

1 o Additional experiments were carried out to demonstrate that the A 1 82 cDNA 

was not an artifact of reverse transcription. Briefly, fiill-length hTRT RNA (i.e., without the 
deletion) was produced by in vitro transcription of pGRN121 for use as a template for RT- 
PCR. Separate cDNA synthesis reactions were carried out using Superscript® reverse 
transcriptase (Bethesda Research Laboratories, Bethesda MD) at 42° or 50 °C, and with 

1 5 random-primers or a specific primer. After 1 5 PCR cycles the longer product was detectable; 
however, the smaller product (i.e., corresponding to the deletion) was not detectable even 
after 30 or more cycles. This indicates that the RT-PCR product is not an artifact. 

EXAMPLE 10 

20 SEQUENCING OF TESTIS hTRT mRNA 

The sequence of the testis form of hTRT RNA was determined by direct 
manual sequencing of DNA fragments generated by PCR from testis cDNA (Marathon Testes 
cDNA, Clontech, San Diego CA) using a ThermoSequenase radiolabeled terminator cycle 
sequencing kit (Amersham Life Science). The PCR step was performed by a nested PCR, as 

25 shown in Table 8. In all cases a negative control reaction with primers but no cDNA was 
performed. The absence of product in the control reaction demonstrated that the products 
derived from the reaction with cDNA present were not due to contamination of hTRT from 
pGRN121 or other cell sources (e.g., 293 cells). The DNA fragments were excised from 
agarose gels to purify the DNA prior to sequencing. 

30 The testis mRNA sequence corresponding to bases 27 to 3553 of the 

pGRN121 insert sequence, and containing the entire hTRT ORF (bases 56 to 3451) was 
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obtained. There were no differences between the testis and the pGRN121 sequences in this 
region. 
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EXAMPLE 11 

DETECTION OF hTRT mRNA BY RNA SE PROTECTION 

RNase protection assays can be used to detect, monitor, or diagnose the 
presence of an hTRT mRNA or variant mRNA. One illustrative RNAse protection probe is an 
in vitro synthesized RNA comprised of sequences complementary to hTRT mRNA sequences 
and additional, non-complementary sequences. The latter sequences are included to 
distinguish the full-length probe from the fragment of the probe that results from a positive 
result in the assay: in a positive assay, the complementary sequences of the probe are 
protected from RNase digestion, because they are hybridized to hTRT mRNA. The 
non-complementary sequences are digested away from the probe in the presence of RNase 
and target complementary nucleic acid. 

Two RNAse protection probes are described for illustrative purposes; either 
can be used in the assay. The probes differ in their sequences complementary to hTRT, but 
contain identical non-complementary sequences, in this embodiment, derived from the S V40 
late mRNA leader sequence. From 5-3\ one probe is comprised of 33 nucleotides of 
non-complementary sequence and 194 nucleotides of sequence complementary to hTRT 
nucleotides 2513 - 2707 for a full length probe size of 227 nucleotides. From 5'-3', the second 
probe is comprised of 33 nucleotides of non-complementary sequence and 198 nucleotides of 
sequence complementary to hTRT nucleotides 2837 - 3035 for a full length probe size of 231 
nucleotides. To conduct the assay, either probe can be hybridized to RNA, i.e., polyA+ RNA, 
from a test sample, and Tl ribonuclease and RNase A are then added. After digestion, probe 
RNA is purified and analyzed by gel electrophoresis. Detection of a 194 nucleotide fragment 
of the 227 nucleotide probe or a 198 nucleotide fragment of the 231 nucleotide probe is 
indicative of hTRT mRNA in the sample. 

The illustrative RNAse protection probes described in this example can be 
generated by in vitro transcription using T7 RNA polymerase. Radioactive or otherwise 
labeled ribonucleotides can be included for synthesis of labeled probes. The templates for the 
in vitro transcription reaction to produce the RNA probes are PCR products. These 
illustrative probes can be synthesized using T7 polymerase following PCR amplification of 
pGRN121 DNA using primers that span the corresponding complementary region of the 
hTRT gene or mRNA. In addition, the downstream primer contains T7 RNA polymerase 
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promoter sequences and the non-complementary sequences. 

For generation of the first RNAse protection probe, the PCR product from the 
following primer pair (T701 and reverseOl) is used: 

T701 5 '-GGG AG ATCT TAATACGACTCACTATAG ATTCA GGCCATGGTG 
CTGCGCCGGC TGTCA GGCTCCC ACGACGTAGT CCATGTTCAC-3'; and reverseOl 
5'-GGGTCTAGAT CCGGAAGAGTGT CTGGAGCAAG-3'. 

For generation of the second RNase protection probe, the PCR product from 
the following primer pair (T702 and reverse02) is used: 

T702 5 *-GGG AGATCT TAATACGACTCACTATAG ATTCA GGCCATGGTG 
CTGCGCCGGC TGTCA GGGCG GCCTTCTGGA CCACGGCATA CC-3'; and reverse02 
5*-G GTCTAGA CGATATCC ACAGGGCCTG GCGC-3'. 

EXAMPLE 12 

rONSTRTTCTION OF A PHVLOGENETTC TREE COMPARING hTRT AND 
OTHER REVERSE TRANSCRIPTASES 

A phylogenetic tree (Figure 6) was constructed by comparison of the seven RT 
domains defined by Xiong and Eickbush (1990, EMBO J. 9:3353). After sequence alignment 
of motifs 1, 2, and A-E from 4 TRTs, 67 RTs, and 3 RNA polymerases, the tree was 
constructed using the NJ (Neighbor Joining) method (Saitou and Nei, 1987, Mol. Biol. Evol. 
4:406). Elements from the same class that are located on the same branch of the tree are 
simplified as a box. The length of each box corresponds to the most divergent element within 
that box. 

The TRTs appear to be more closely related to RTs associated with msDNA, 
group II introns, and non-LTR (Long Terminal Repeat) retrotransposons than to the LTR- 
retrotransposon and viral RTs. The relationship of the telomerase RTs to the non-LTR 
branch of retroelements is intriguing, given that these latter elements have replaced 
telomerase for telomere maintenance in Drosophila. However, the most striking finding is 
that the TRTs form a discrete subgroup, almost as closely related to the RNA-dependent 
RNA polymerases of plus-stranded RNA viruses such as poliovirus as to any of the 
previously known RTs. Considering that the four telomerase genes come from evolutionarily 
distant organisms ~ protozoan, fungi, and mammal - this separate grouping cannot be 
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explained by lack of phylogenetic diversity in the data set. Instead, this deep bifurcation 
suggests that the telomerase RTs are an ancient group, perhaps originating with the first 
eukaryote. 

GenBank protein identification or accession numbers used in the phylogenetic 
5 analysis were: msDNAs (94535, 134069, 134074,134075, 134078), group II introns (483039, 
101880, 1332208, 1334433, 1334435, 133345, 1353081), mitochondrial plasmid/RTL 
(903835, 134084), non-LTR retrotransposons (140023, 84806, 103221, 103353, 134083, 
435415, 103015, 1335673, 85020, 141475, 106903, 130402, U0551, 903695, 940390, 
2055276, L08889), LTR retrotransposons (74599, 85105, 130582, 99712, 83589, 84126, 
10 479443, 224319, 130398, 130583, 1335652, 173088, 226407, 101042, 1078824), 

hepadnaviruses (1 18876, 1706510, 118894), caulimoviruses (331554, 130600, 130593, 
93553), retroviruses (130601, 325465, 74601, 130587, 130671, 130607, 130629, 
130589,130631, 1346746, 130651, 130635, 1780973, 130646). Alignment was analyzed 
using ClustalW 1.5 [J. D. Thompson, D. G. Higgins, T. J. Gibson, Nucleic Acids Res. 22, 
15 4673 (1994)] and PHYLIP 3.5 [J. Felsenstein, Cladisfics 5, 164 (1989)]. 

EXAMPLE 13 

TP ANSFECTTON OF CTILTTIRET) HUMAN FIBROBLASTS (BJ) WITH CONTROL 
PTASMID ANT) PL A SMTP FNCODTNG hTRT 

20 This example demonstrates that expression of recombinant hTRT protein in a 

mammalian cell results in the generation of an active telomerase. 

Subconfluent BJ fibroblasts were trypsinized and resuspended in fresh 
medium (DMEM/199 containing 10% Fetal Calf Serum) at a concentration of 4 x 10 s 
cells/ml. The cells were transfected using electroporation with the BioRad Gene Pulser™ 

25 electroporator. Optionally, one may also transfect cells using Superfect™ reagent (Qiagen) in 
accordance with the manufacturer's instructions. For electroporation, 500 ul of the cell 
suspension were placed in an electroporation cuvette (BioRad, 0.4 cm electrode gap). Plasmid 
DNA (2 |j.g) was added to the cuvettes and the suspension was gently mixed and incubated on 
ice for 5 minutes. The control plasmid (pBBS212) contained no insert behind the MPSV 

30 promoter and the experimental plasmid (pGRN133) expressed hTRT from the MPSV 

promoter. The cells were electroporated at 300 Volts and 960 uPD. After the pulse was 
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delivered, the cuvettes were placed on ice for approximately 5 minutes prior to plating on 100 
mm tissue culture dishes in medium. After 6 hours, the medium was replaced with fresh 
medium. 72 hours after the transfection, the cells were trypsinized, washed once with PBS, 
pelleted and stored frozen at -80°C. Cell extracts were prepared at a concentration of 25,000 
cells/|nl by a modified detergent lysis method (see Bodnar et al., 1996, Exp. Cell Res. 228:58; 
Kim et al., 1994, Science 266:201 1, and as described in patents and publications relating to 
the TRAP assay, supra) and telomerase activity in the cell extracts was determined using a 
modified PCR-based TRAP assay (Kim et al., 1994, Bodnar et al., 1996). Briefly, 5 xlO 4 cell 
equivalents were used in the telomerase primer extension portion of the reaction. While the 
extract is typically taken directly from the telomerase extension reaction to the PCR 
amplification, one may also extract once with phenol/chloroform and once with chloroform 
prior to the PCR amplification. One-fifth of the material was used in the PCR amplification 
portion of the TRAP reaction (approximately 10,000 cell equivalents). One half of the TRAP 
reaction was loaded onto the gel for analysis, such that each lane in Figure 25 represents 
reaction products from 5,000 cell equivalents. Extracts from cells transfected with 
pGRN133 were positive for telomerase activity while extracts from untransfected (not 
shown) or control plasmid transfected cells showed no telomerase activity. Similar 
experiments using RPE cells gave the same result. 

Reconstitution in B J cells was also carried out using other hTRT constructs 
(i.e., pGRN145, pGRN155 and pGRN138). Reconstitution using these constructs appeared 
to result in more telomerase activity than in the pGRN133 transfected cells. 

The highest level of telomerase activity was achieved using pGRNl 55. As 
discussed supra, pGRN155 is a vector containing the adenovirus major late promoter as a 
controlling element for the expression of hTRT and was shown to reconstitute telomerase 
activity when transfected into B J cells. 

Notably, when reconstitution using the hTRT-GFP fusion protein pGRN138 
(which localizes to the nucleus, see Example 1 5, infra) was performed either in vitro (see 
Example 7) or in vivo (transfection into B J cells) telomerase activity resulted. By transfection 
into BJ cells, for example, as described supra, telomerase activity was comparable to that 
resulting from reconstitution in vitro using pGRN133 or pGRN145. 

Similar results were obtained upon transfection of normal human retinal 
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pigmented epithelial (RPE) with the hTRT expression vectors of the invention. The 
senescence of RPE cells is believed to contribute to or cause the disease of age-related 
macular degeneration. RPE cells treated in accordance with the methods of the invention 
using the hTRT expression vectors of the invention should exhibit delayed senescence, as 
compared to untreated cells, and so be useful in transplantation therapies to treat or prevent 
age-related macular degeneration. 

EXAMPLE 14 
PROMOTER REPORTER CONSTRUCT 

This example describes the construction of plasmids in which reporter genes 
are operably linked to hTRT upstream sequences containing promoter elements. The vectors 
have numerous uses, including identification of cis and trans transcriptional regulatory 
factors in vivo and for screening of agents capable of modulating (e.g., activating or 
inhibiting) hTRT expression (e.g., drug screening). Although a number of reporters may be 
used (e.g., firefly luciferase, p-glucuronidase, p-galactosidase, chloramphenicol acetyl 
transferase, and GFP and the like), the human secreted alkaline phosphatase (SEAP; 
CloneTech) was used for initial experiments. The SEAP reporter gene encodes a truncated 
form of the placental enzyme which lacks the membrane anchoring domain, thereby allowing 
the protein to be secreted efficiently from transfected cells. Levels of SEAP activity detected 
in the culture medium have been shown to be directly proportional to changes in intracellular 
concentrations of SEAP mRNA and protein (Berger et al., 1988, Gene 66: 1 ; Cullen et al., 
1992, Meth. Enzymol. 216:362). 

Four constructs (pGRN148, pGRN150, "pSEAP2 basic" (no promoter 
sequences = negative control) and M pSEAP2 control" (contains the SV40 early promoter and 
enhancer) were transfected in triplicate into mortal and immortal cells. 

Plasmid pGRN148 was constructed as illustrated in Figure 9. Briefly, a Bgl2- 
Eco47III fragment from pGRN144 was digested and cloned into the Bglll-Nrul site of 
pSeap2Basic (Clontech, San Diego, CA). A second reporter-promoter, plasmid pGRN150, 
includes sequences from the hTRT intron described in Example 3, to employ regulatory 
sequences that may be present in the intron. The initiating Met is mutated to Leu, so that the 
second ATG following the promoter region will be the initiating ATG of the SEAP ORF. 
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The pGRN148 and pGRN150 constructs (which include the hTRT promoter) 
were transfected into mortal (BJ cells) and immortal (293) cells. All transfections were done 
in parallel with two control plasmids: one negative control plasmid (pSEAP basic) and one 
positive control plasmid (pSEAP control which contains the S V40 early promoter and the 
SV40 enhancer). 

In immortal cells, pGRN148 and pGRN150 constructs appear to drive SEAP 
expression as efficiently as the pSEAP2 positive control (containing the SV40 early promoter 
and enhancer). In contrast, in mortal cells only the pSEAP2 control gave detectable activity. 
These results indicate that, as expected, hTRT promoter sequences are active in tumor cells 
but not in mortal cells. 

Similar results were obtained using another normal cell line (RPE, or retinal 
pigmental epithelial cells). In RPE cells transfected with pGRNl 50 (containing 2.2 KB of 
upstream genomic sequence), the hTRT promoter region was inactive while the pSEAP2 
control plasmid was active. 

As noted supra, plasmids in which reporter genes are operably linked to hTRT 
upstream sequences containing promoter elements are extremely useful for identification and 
screening of telomerase activity modulatory agents, using both transient and stable 
transfection techniques. In one approach, for example, stable transformants of pGRN148 are 
made in telomerase negative and telomerase positive cells by cotransfection with a eukaryotic 
selectable marker (such as neo) according to Ausubel et al., 1997, supra. The resulting cell 
lines are used for screening of putative telomerase modulatory agents, for example, by 
comparing hTRT-promoter-driven expression in the presence and absence of a test 
compound. 

The promoter-reporter (and other) vectors of the invention are also used to 
identify trans- and cis-acting transcriptional and translational regulatory elements. Examples 
of cis-acting transcriptional regulatory elements include promoters and enhancers of the 
telomerase gene. The identification and isolation of cis- and trans- acting regulatory agents 
provide for further methods and reagents for identifying agents that modulate transcription 
and translation of telomerase. 

To identify sequences or elements that play a role in hTRT expression, 
expression was tested using promoter-reporter constructs with varying amounts of the 
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upstream region (5 ( to the transcription initiation site) of the hTRT gene. Experiments were 
conducted using pGRN 150 [which contains approximately 2405 bp of genomic sequence 
upstream of the most 5 # nucleotide present in the hTRT cDNA], pGRN 176 [which contains 
approximately 1 86 bp of genomic sequence upstream of the most 5 f nucleotide present in the 
5 hTRT cDNA] and pGRN 175 [which contains approximately 77 bp of genomic sequence 
upstream of the most 5' nucleotide present in the hTRT cDNA]. The following sequence is 
present in pGRN 176 but not pGRN 175: 5-GTGGCGGAGGGACTGGGGACCCGGGC 
ACCGGTCCTGCCCCTTCACCTTCCAGCTCCGCCTCGTCCGCGCGGAACCCCGCCC 
CGTCCCGAACCCTTCCCGGGTCCCCGGCCCAGCCCCTTCCGGG-3*. 

10 When transfected into mortal cells (RPE and BJ), the pGRN 1 75 promoter was 

active, while the pGRN 176 and pGRN 150 promoters were not active. These results 
demonstrate that the approximately 120 basepair region present in pGRN 176 but not pGRN 
175 includes sequences that play a role in the mortal-cell specific repression of hTRT gene 
expression is achieved. It will be recognized that less than the entire approximately 120 

15 basepair sequence may be required for this effect, and that other sequences not in the 

approximately 120 base pair region may also play a role (independently or in combination 
with the approximately 120 base pair region) in regulation of hTRT expression. Thus, the 
approximately 120 base pair region includes all or part of one or more cis-acting elements. 
Without intending to be bound by any particular mechanism, the 

20 approximately 120 base pair sequence includes a binding site for a repressor (e.g., a trans 
acting repressor) which upon binding prevents initiation of transcription of the hTRT gene. 
Such a repressor may be the product of an anti-oncogene (e.g., a novel anti-oncogene), which 
can be identified and cloned in accordance with the teachings herein and the use of the novel 
reagents disclosed herein. In normal cells, repressor binding or interaction with hTRT 

25 regulatory sequences (e.g., including or within the approximately 120 base pair sequence) 
results in the absence of hTRT protein and therefore of telomerase activity. Activation of 
telomerase in cancer cells can result from the loss of hTRT repressor activity. 

A number of applications of the "approximately 120 base pair region" 
described above will be immediately apparent upon review of this disclosure, including for 

30 treatment or diagnosis of telomerase related diseases and identification of agents with 

telomerase modulatory activity. For example, using standard techniques, the sequence may 
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be used to identify agents or proteins (e.g. naturally occurring repressor proteins) that 
specifically bind to the approximately 120 base pair sequence or a subsequence thereof. In 
addition, synthetic or naturally occurring agents that increase or stabilize repression (e.g., by 
binding or otherwise interacting with the sequence, by stabilizing binding by a naturally 
5 occurring repressor, or by other means) will be useful for reducing telomerase activity in a 
cell (e.g., for treatment of malignancy). Similarly, agents that reduce repression (e.g., by 
inhibiting repressor binding, or by other means) will be useful for increasing telomerase 
expression (e.g., by controlled activation), for example to increase the proliferative capacity 
of normal cells). 

10 

EXAMPLE IS 
SUBCELLULAR LOCALIZATION OF hTRT 

A fusion protein having hTRT and enhanced green fluorescent protein (EGFP; 
Cormack et al., 1996, Gene 173:33) regions was constructed as described below. The EGFP 
15 moiety provides a detectable tag or signal so that the presence or location of the fusion 
protein can be easily determined. Because EGFP-fusion proteins localize in the correct 
cellular compartments, this construct may be used to determine the subcellular location of 
hTRT protein. 

20 A. Construction of pGRN138 

A vector for expression of an hTRT-EGFP fusion protein in mammalian cells 

was constructed by placing the EcoRI insert from pGRN124 (see Example 6) into the EcoRI 

site of pEGFP-C2 (Clontech, San Diego, CA). The amino acid sequence of the fusion protein 

is provided below. EGFP residues are in bold, residues encoded by the 5 f untranslated region 

25 of hTRT mRNA are underlined, and the hTRT protein sequence is in normal font. 

MVSKGEELPTGWPILVELDGDVNGHKFSVSGEGEGDATYGKLTLKFICTTGKLPVPWPT 
LVTTLTYGVQCFSRYPDHMKQHDFFKSAMPEGYVQERTIFFKDDGNYKTRAEVKFEGDTL 
VNRIELKGIDFKEDGNILGHKIjEYNyWSHNVYIMADKQKNGIKVNFKIRHNIEDGSVQIjA 
DHYQQNTPIGDGPVLLPDNHYLSTQSALSKDPNEKRDHMVLLEFVTAAGITLGMDELYKS 

30 GRTO I S S S SFEFAAAS TORCVIjIiRTWEAIx&PATPAM PRAPRCRAVRS KLRSHYREVL PLA 
TFVRRLGPQGWRLVQRGDPAAFRALVAQCLVCVPWDARPPPAAPSFRQVSCLKELVARVL 
QRLCERGAKNVLAFGFALLDGARGGP PEAFTTS VRS YLPNTVTDALRGS GAWGLLLRRVG 
DD VTjVHIjIjARCAL FVLVAPS CAyQVCGPPLYQLGAATQARP PPHASGPRRRLGCERAWNH 
SVREAGVPLGLPAPGARRRGGSASRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRGP 

35 SDRGFCWSPARPAEEATSLEGALSGTRHSHPSVGRQHHAGPPSTSRPPRPWDTPCPPVY 
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AETKHFLYSSGDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQR 
YWQMRPLFLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDP 
RRLVQLLRQHSSPWQVYGFVRACLRRLVPPGLWGSRHNERRFLRNTKKFISLGKHAKLSL 
QELTWKMSVRDCAWLRRSPGVGCVPAAEHRLREEILAKFLHWLMSVYVVELLRSFFYVTE 
5 TTFQKNRLFFYRPSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFIP 
KPDGLRPIVNMDYWGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGASVLGLDD 
IHRAWRTFVLRVRAQDPPPELYFVKVDVTGAYDTI PQDRLTEVIAS I IKPQNTYCVRRYA 
WQKAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAWIEQSSSLNEASSGL 
FDVFLRFMCHHAVRIRGKSYVQCQGIPQGSILSTLLCSLCYGDMENKLFAGIRRDGLLLR 
1 0 LVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMPAH 
GLFPWCGLLLDTRTLEVQSDYSSYARTSIRASVTFNRGFKAGRNMRRKLFGVLRLKCHSL 
FLDLQVNSLQWCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPTFFLRVISDTASLCYS 
I LKAKNAGMS LGAKGAAGPL P S E AVQWLCHQ AFLLKLTRHRVTYVPLLGSLRTAQTQLSR 
KLPGTTLTALEAAANPALPSDFKTILD 

15 

Other EGFP fusion constructs can be made using partial (e.g., truncated) hTRT coding 
sequence and used, as described infra, to identify activities of particular regions of the hTRT 
polypeptide. 



20 B. Nuclear Localization and Uses of pGRN138 

Transfection of NIH 293 and BJ cells with pGRN138 confirmed the nuclear 
localization of recombinantly expressed hTRT. Cells were transfected with pGRN138 
(EGFP-hTRT) and with a control construct (expressing EGFP only). Nuclear localization of 
the EGFP-hTRT is apparent in both cell types by fluorescence microscopy. As noted supra, 

25 the pGRN138 hTRT-GFP fusion protein supports reconstitution of telomerase activity in both 
an in vitro transcription translation system and in vivo when transfected into BJ cells. 

The hTRT-EGFP fusion proteins (or similar detectable fusion proteins) can be 
used in a variety of applications. For example, the fusion construct described in this example, 
or a construct of EGFP and a truncated form of hTRT, can be used to assess the ability of 

30 hTRT and variants to enter a cell nucleus and/or localize at the chromosome ends. In 
addition, cells stably or transiently transfected with pGRN138 are used for screening 
compounds to identify telomerase modulatory drugs or compounds. Agents that interfere 
with nuclear localization or telomere localization can be identified as telomerase inhibitors. 
Tumor cell lines stably expressing EGFP-hTRT can be useful for this purpose. Potential 

35 modulators of telomerase will be administered to these transfected cells and the localization 
of the EGFP-hTRT will be assessed. In addition, FACS or other fluorescence-based methods 
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can be used to select cells expressing hTRT to provide homogeneous populations for drug 
screening, particularly when transient transfection of cells is employed. 

In other applications, regions of the hTRT can be mutagenized to identify 
regions (e.g., residues 193-196 (PRRR) and 235-240 (PKRPRR)) required for nuclear 
5 localization, which are targets for anti-telomerase drugs (telomerase activity modulators). 
Other applications include: 

use of the fusion protein as a fluorescent marker of efficient cell transfection 
for both transient transfection experiments and when establishing stable cell lines expressing 
EGFP-hTRT; 

10 expression of an hTRT-EGFP fusion with mutated nuclear localization signals 

(deficient for nuclear localization) in immortal cells so that the hTRT mutant-EGFP 
scavenges all the hTR of the immortal cells, retaining it in the cytoplasm and preventing 
telomere maintenance; and 

use as a tagged protein for immunoprecipitation. 

15 

EXAMPLE 16 

EFFECT OF MUTATION ON TELOMERASE CATALYTIC ACTIVITY 

This example describes hTRT variant proteins having altered amino acids and 

altered telomerase catalytic activity. Amino acid substitutions followed by functional 
20 analysis is a standard means of assessing the importance and function of a polypeptide 

sequence. This example demonstrates that changes in the reverse transcriptase (RT) and 

telomerase (T) motifs affect telomerase catalytic activity. 

Conventional nomenclature is used to describe mutants: the target residue in 

the native molecule (hTRT) is identified by one-letter code and position, and the 
25 corresponding residue in the mutant protein is indicated by one-letter code. Thus, for 

example, "K626A" specifies a mutant in which the lysine at position 626 (i.e., in motif 1) of 

hTRT is changed to an alanine. 

A. Mutation of hTRT FFYxTE Motif 
30 In initial experiments, a vector encoding an hTRT mutant protein, "F560A," 

was produced in which amino acid 560 of hTRT was changed from phenylalanine (F) to 
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alanine (A) by site directed mutagenesis of pGRN121 using standard techniques. This 
mutation disrupts the TRT FFYxTE motif. The resulting F560A mutant polynucleotide was 
shown to direct synthesis of a full length hTRT protein as assessed using a cell-free 
reticulocyte lysate transcription/translation system in the presence of 35 S-methionine. 

When the mutant polypeptide was co-translated with hTR, as described in 
Example 7, no telomerase activity was detected as observed by TRAP using 20 cycles of 
PCR, while a control hTRT/hTR cotranslation did reconstitute activity. With 30 cycles of 
PCR in the TRAP assay, telomerase activity was observable with the mutant hTRT, but was 
considerably lower than the control (wild-type) hTRT. 

B. Additional Site-Directed Mutagenesis of hTRT Amino Acid Residues 

Conserved amino acids in six RT motifs were changed to alanine using 
standard site directed mutagenesis techniques (see, e.g., Ausubel, supra) to assess their 
contribution to catalytic activity. The mutants were assayed using IVR telomerase using the 
two step conventional/TRAP assay detailed in example 7. 

The K626A (motif 1), R631A (motif 2), D712A (motif A), Y717A (motif A), 
D868A (motif C) mutants had greatly reduced or undetectable telomerase activity (<1% of 
wild-type), while the Q833A (motif B) and G932A (motif E) mutants exhibited 
low/intermediate levels of activity (<10% of wild-type). Two mutations outside the RT 
motifs, R688A and D897A, had activity equivalent to wild type hTRT. These results were 
consistent with analogous mutations in reverse transcriptases (Joyce et al., 1994, Ann. Rev. 
Biochem. 63:777) and are similar to results obtained with Est2p (see Lingner, 1997, Science 
276:561). The experiments identify residues in the RT motifs critical and not critical for 
enzymatic activity and demonstrate that hTRT is the catalytic protein of human telomerase. 
The mutations provide variant hTRT polypeptides that have utility, e.g., as dominant/negative 
regulators of telomerase activity. 

Amino acid alignment of the known TRTs identified a telomerase-specific 
motif, motif T (see supra). To determine the catalytic role of this motif in hTRT, a six amino 
acid deletion in this motif (A560-565; FFYxTE), was constructed using standard site directed 
mutagenesis techniques (Ausubel, supra). The deletion was assayed using IVR telomerase 
using the two step conventional/TRAP assay detailed in Example 7. The A560-565 mutant 



279 



II • 

had no observable telomerase activity after 25 cycles of PCR whereas wild type hTRT IVR 
telomerase produced a strong signal. Each amino acid in each residue in motif T was 
examined independently in a similar manner; mutants F560A, Y562A, T564A, and E565A 
retained intermediate levels of telomerase activity, while a control mutant, F487A, had 
5 minimal affect on activity. Notably, mutant F561 A had greatly reduced or undetectable 
telomerase activity, while activity was fully restored in its "revertant", F561 A561F. 
F561 A561Fchanges the mutated position back to its original phenylalanine. This is a control 
that demonstrates that no other amino acid changes occurred to the plasmid that could 
account for the decreased activity observed. Thus, the T motif is the first non-RT motif 

1 0 shown to be absolutely required for telomerase activity. 

Motif T can be used for identification of TRTs from other organisms and 
hTRT proteins comprising variants of this motif can be used as a dominant/negative regulator 
of telomerase activity. Unlike most other RTs, telomerase stably associates with and 
processively copies a small portion of a single RNA (ie. hTR), thus motif T can be involved 

15 in mediating hTR binding, the processivity of the reaction, or other functions unique to the 
telomerase RT. 

In other experiments, it was observed that the deletion variant encoded by 
pro90hTRT described herein, did not reconstitute telomerase activity when co-synthesized 
with hTR, as measured using a modified TRAP assay (Autexier et aL, 1 996, EMBO Journal 
20 1 5 :5928, which is incorporated herein by reference). 

Example 17 

SCREENING FOR TELOMERASE ACTIVITY MODULATORS USING 
RECOMBINANTLY EXPRESSED TELOMERASE COMPONENTS 

25 This example describes the use of in vitro reconstituted telomerase for 

screening and identifying telomerase activity modulators. The assay described is easily 
adapted to high-through-put methods (e.g., using multiple well plates and/or robotic systems). 
Numerous variations on the steps of the assay will be apparent to one of skill in the art after 
review of this disclosure. 

30 Recombinant clones for telomerase components (e.g., hTRT and hTR) are 

transcribed and translated (hTRT only) in an in vitro reaction as follows and as described in 
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Example 7 supra, using the TNT® T7 Coupled Reticulocyte lysate system (Promega), which 
is described in U.S. Patent No. 5,324,637, following the manufacturer's instructions: 

Reagent 

5 TNT Rabbit Reticulocyte lysate 

TNT reaction buffer 

TNT T7 RNA Pol. 

AA mixture (complete) 

Prime RNase inhibitor 
10 Nuclease-free water 

Xbal cut pGRN121 [hTRT] (0.5 ug) 

Fspl cut pGRN164 [hTR] (0.5 ug) 



1 5 The reaction is incubated at 30°C for 2 hours. The product is then purified on an 
ultrafree-MC DEAE filter (Millipore). 

The recombinant telomerase product (IVRP) is assayed in the presence and 
absence of multiple concentrations of test compounds which are solubilized in DMSO (e.g. 
10 uM - 100 uM). Test compounds are preincubated in a total volume of 25 uL for 30 

20 minutes at room temperature in the presence of 2.5 uL IVRP, 2.5% DMSO, and IX TRAP 
Buffer (20 mM Tris-HCl, pH 8.3, 1.5mM MgCl 2 , 63 mM KC1, 0.05%Tween20, 1.0 mM 
EGTA, 0.1 mg/ml Bovine serum albumin). Following the preincubation, 25 uL of the TRAP 
assay reaction mixture is added to each sample. The TRAP assay reaction mixture is 
composed of IX TRAP buffer, 50uL dNTP, 2.0 ug/ml primer ACX, 4 ug/ml primer U2, 0.8 

25 attomol/ml TSU2, 2 units/50ul Taq polymerase (Perkin Elmer), and 2 ug/ml 

[ 32 P]5'end-labeled primer TS (3000Ci/mmol). The reaction tubes are then placed in the PCR 
thermocycler (MJ Research) and PCR is performed as follows: 60 min at 30°C, 20 cycles of 
{30 sec at 94°C, 30 sec. at 60°C, 30 sec. at 72°C}, 1 min at 72°C, cool down to 10°C. The 
TRAP assay is described, as noted supra, in U.S. Patent No. 5,629,154. The primers and 

30 substrate used have the sequences: TS Primer (5 , -AATCCGTCGAGCAGAGTT-3 r ); ACX 
Primer (5'-GCGCGG[CTTACC]3CTAACC-3'); U2 primer 
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Amount per reaction (uL) 
25 
2 
1 
1 
1 

16 

2 
2 
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(5'-ATCGCTTCTCGGCCTTTT-3'); TSU2 

(S'-AATCCGTCGAGCAGAGTTAAAAGGCCGAGAAGCGAT^O 

After completion of the PCR step, 4 (il of 10X loading buffer containing 
bromophenol blue is added to each reaction tube and products (20 |il) are run on a 12.5% 
5 non-denaturing PAGE in 0.5X TBE at 400 V. The completed gel is subsequently dried and 
the TRAP products are visualized by Phosphorimager or by autoradiography. The telomerase 
activity in the presence of the test compound is measured by comparing the incorporation of 
label in reaction product to a parallel reaction lacking the agent. 
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The following clones described in the Examples have been deposited with the 
American Type Culture Collection, Rockville, MD 20852, USA: 
Lambda phage A, 25-1.1 ATCC accession number 209024 
pGRN121 ATCC accession number 209016 

1 5 Lambda phage XG<3>5 ATCC accession number 98505 

The present invention provides novel methods and materials relating to hTRT 
and diagnosis and treatment of telomerase-related diseases. While specific examples have 

20 been provided, the above description is illustrative and not restrictive. Many variations of the 
invention will become apparent to those of skill in the art upon review of this specification. 
The scope of the invention should, therefore, be determined not with reference to the above 
description, but instead should be determined with reference to the appended claims along 
with their full scope of equivalents. 

25 All publications and patent documents cited in this application are 

incorporated by reference in their entirety for all purposes to the same extent as if each 
individual publication or patent document were so individually denoted. 



282 



WHAT IS CLAIMED IS: 

1 . An isolated, substantially pure, or recombinant protein preparation of 
a human telomerase reverse transcriptase (hTRT) protein, or a variant thereof, or a fragment 

5 thereof 

2. An isolated, synthetic, substantially pure, or recombinant 
polynucleotide that is at least ten nucleotides to 3kb in length and comprises a contiguous 
sequence of at least ten nucleotides that is identical or exactly complementary to a contiguous 

1 0 sequence encoding a recombinant protein of claim 1 . 

3. The polynucleotide of claim 2 that encodes an hTRT protein or 

fragment. 

15 4. A method of identifying a compound that modulates hTRT activity, 

said method comprising the steps of contacting an hTRT protein of claim 1 with said 
compound and measuring a change in a property or activity of said hTRT, wherein a 
statistically significant change in said property or activity identifies said compound as a 
modulator of hTRT activity. 

20 

5. The method of claim 4 wherein the compound is an inhibitor of hTRT 

activity. 

6. A method of preparing recombinant telomerase, said method 
25 comprising contacting a recombinant hTRT protein of claim 1 with a telomerase RNA 

component under conditions such that said recombinant protein and said telomerase RNA 
component associate to form a telomerase enzyme capable of catalyzing the addition of 
nucleotides to a telomerase substrate. 

30 7. The method of claim 6, wherein the hTRT protein has a sequence of 

Figure 17. 
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8. The method of claim 7, wherein the hTRT protein is produced in an in 
vitro expression system. 

9. The method of claim 6, wherein a said hTRRT protein is substantially 
5 purified before said contacting. 

10. A method for increasing the proliferative capacity of a vertebrate cell 
by introducing a recombinant hTRT polynucleotide of claim 3 into the cell, and wherein said 
sequence is operably linked to a promoter. 

10 

11. A method of detecting the presence of at least one telomerase positive 
human cell in a biological sample comprising human cells, said method comprising the steps: 

a) measuring the amount of an hTRT gene product in said 

sample, 

15 b) comparing the amount measured with a control correlating 

to a sample lacking telomerase positive cells, 

wherein the presence of a higher level of the hTRT gene product in said 
sample as compared to said control is correlated with the presence of telomerase positive cells 
in the biological sample. 

20 

1 2. The method of claim 1 1 , wherein said telomerase positive cells are 

cancer cells. 

1 3 . The method of claim 1 1 , wherein the amount of an hTRT gene 
25 product is measured using an antibody. 

1 4. The method of claim 1 1 , wherein the amount of an hTRT gene 
product is measured using a nucleotide probe. 

30 
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15. The method of claim 1 1, wherein said detecting involves diagnosing a 
telomerase-related condition in a patient, and said method further comprises the steps of: 

a) obtaining a cell or tissue sample from the patient; 

b) measuring the amount of an hTRT gene product in the cell 

5 or tissue; and, 

c) comparing the amount of hTRT gene product in the cell or 
tissue with the amount in a healthy cell or tissue of the same type; 

wherein a different amount of hTRT gene product in the sample from 
the patient and the healthy cell or tissue is diagnostic of a telomerase-related condition. 

10 

16. The method of claim 15 wherein the amount is higher in said sample 
than in said healthy cell or tissue and said telomerase-related condition is cancer. 

17. A method for treatment of a condition associated with an elevated 
level of telomerase activity within a cell, comprising introducing into said cell a 
therapeutically effective amount of an inhibitor of said telomerase activity, wherein said 
inhibitor is an hTRT polypeptide, an antibody that binds hTRT, or an hTRT polynucleotide. 

1 8. The method of claim 1 7, wherein the inhibitor is an oligonucleotide 
20 comprising the sequence of Figure 1 7 or a subsequence or variant thereof. 

19. The method of claim 18, wherein the oligonucleotide comprises 
nonstandard or derivatized bases or linkages between bases. 

25 20. The method of claim 1 7, wherein the inhibitor is a polynucleotide that 

inhibits binding of endogenous hTRT to hTR. 
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ABSTRACT 

The invention provides compositions and methods related to human telomerase reverse 
transcriptase (hTRT), the catalytic protein subunit of human telomerase. The polynucleotides 
5 and polypeptides of the invention are useful for diagnosis, prognosis and treatment of human 
diseases, for changing the proliferative capacity of cells and organisms, and for identification and 
screening of compounds and treatments useful for treatment of diseases such as cancers. 
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Seq. ID. No 7 



131 GGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGC 
CCTGGGCCGCCGAAAGGCGCGCGACCACCGGGTCACGGACCACACGCACGGGACCCTGCG 



NFkB_CSl 
GGGRQTYYQC 
NFkB-MHC-1.2 
TGGGCTTCCCC 



2 41 ACGGCCGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGGGCCTCCCCGGGGTCGGCGTCCG 

TGCCGGCGGGGGGCGGCGGGGGAGGAAGGCGGTCCACCCGGAGGGGCCCCAGCCGCAGGC 

Intronl 

- j 1 GCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTC 
CGACCCCAACTCCCGCCGGCCCCCCTTGGTCGCTGTACGCCTCTCGTCGCGTCCGCTGAG 

NFkB_CSl 
GGGRQTYYQC 
NFkB_C32 
RGGGRMTYYCC 

Topo_I I_cleavage__s i t e 
RNYNNCLJNGYNGKTNYNY 

3 51 AGGGCGCTTCCCCCGCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGG 

TCCCGCGAAGGGGGCGTCCACAGGACGGACTTCCTCGACCACCGGGCTCACGACGTCTCC 



I HI II ' 



0 
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! AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTCGAAATAT 

5 i AACCTCAGTA TTAATAAGCT CaGATTTTAA ATATTAATTA CAAAACCTAA 

! 0 1 ATGGAGGTTG ATGT7GATAA TCA AGCTGAT AATCATGGCA TTCACTCAGC 

: 5 I TCTTAAGACT TGTGAAGAAA TTAAAGAAGC TAAAACGTTC TACTCTTGGA 

20 1 TCCAGAAAGT TATTAGATGA AGAAATCAAT C7CAAAGTCA TTATAAAGAT 

251 TTAGAAGATA TTAAAATATT TGCGC AG AC A AATATTGTTG CTACTCCACG 

301 AGACTATAAT GAAGAAGATT TTAA AGTTAT TGCAAGAAAA GAAGTATTTT 

3 5 i CAACTGGACT AATGATCGAA CTTATTGACA AATGCTTAGT TGAACTTCTT 
40 1 TCATCAAGCG ATGTTTCAGA TAGACAAAAA CTTCAATGAT 7TGGATT7CA 

4 5 I ACTTAAGGGA A ATC AATTAG C A A AG ACCCA TTTATTAACA GCTCTTTCAA 
50 I CTCAAAAGCA GTATTTCTTT C.AAGACGA AT GGAACCAAGT TAG AGCAATG 

5 5 i ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT taatattcc a 
60 i GCGAACTTCT GAAGGAACTC TTGTTCAATT TTGCGGGAAT AACGTTTTTG 
65 i ATCATTTGAA AGTCAACGAT AAGTTTGACA aaaagcaaaa aggtggagca 
70 I GCAGACATGA ATGAACCTCG atgttgatca ACCTGCAAAT ACAATG7CAA 
'51 G A- A TG A G AAA . G A T C A C TTTC TCAACAACAT CAACGTGCCG AATTGGAATA 
301 A.TATGAAATC AAGAACCAGA aTaTTTTATT CCACTCATTTTAATAGAAAT 
ZSl AACCAATTCTTCAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 
^0 1 AGCGATGGAC AGAGCTCAG A C G A T.\ TTC AC G AATATATTC AGATTTAATA 
°i i GAATTAGAAA GAAGCTAAAA CATAaGGTTA TCGAAAAAAT TGCCTACATG 
i 00 1 CTTGAGAAAG TCAAAGATTT TAACTTCAAC TACTATTTAA C AAAATCTTG 

! 05 I TCCTCTTCCA GAAAATTGGC GGG AACCGAA ACAAAAAATC CAAAACTTGA 

! 101 TAAATAAAAC TAGAGAAGAA AAGTCCAAGT ACTATGAAGA GCTGTTTAGC 

i 1 5 1 TACACAACTG ATAATAAATG CGTCACACAA TTTATTAATG AATTTTTCTA 

1 20 i C AA TATA CTC CCCA AAG ACT TTTTG ACTGG A AG AAACCGT AAGAATTTTC 

1251 AAAAGAAAGT "AAG AAA TAT GTGGAACTAA ACAAGCATGA ACTCATTCaC 

; 30 1 AAAAACTTAT TGCTTCAGAA GATCAATACA AGAGAAATAT CATGCATGCA 

! 3 5 1 GGTTGAGACC TCTGCAAAGC A i I 1 I i ATT A TTTTG ATCAC GAAAACATCT 

. ^O I ACGTCTTATG GaaaTTGCTC CGATGGATAT TCGAGGATCT CGTCCTCTCG 

451 CTGATTAGAT GATTTTTCTA TGTCaCCCAG CAACAGAAAA GTTACTCCAA 

. 50 1 aaCCTATTaC TACAGAAAGA aTaTTTGGGA CGTCATTATG AAAATGTCAA 

'551 TCGCAGACTT AAAGAAGG A A ACGCTTGCTG AGGTCCAAGA AAAAG AGGTT 

: 60 1 GAAGAATGGA AAAAGTCGCT TGGATTTGCA CCTGGAAAAC TCAGACTAAT 

; 65 i ACCGAAGAAA ACTACTTTCC GTCCAATTAT GACTTTCAAT AAGAAGATTG 

: 70 1 TAAATTCAGA CCCGAAGACT ACAAAATTAA CTACAAATAC GAAGTTATTG 

1751 AACTCTCACT TAATGCTTA A GaCATTGAAG AATAGAATGT TTAAAGATCC 

1 30 1 TTTTGGATTC GCTGTTTTTA ACTATGATGA TGTAATCAAA AAGTATGaGG 
1351 AGTTTGTTTG CAAATCGAAG CAAGTTGGAC AACCAAAACTCTTCTTTGCA 

1 90 1 ACTATGGATA TCGAAAAGTG ATATGaTAGT GTAAACAGAG AAAAACTATC 

1 95 i AACATTCCTA AAAACTACTA AATTACTTTC TTCAGATTTC TGGATTATGA 

200 1 CTGCACAAAT TCTAA AGAGA A AG A ATA AC A TAGTTATCCA TTCGAAAAAC 

205 1 TTTAGAAAGA AAGAAATGAA AGATTaTTTT AGACAGAAAT TCCAGAAGAT 

210 \ TGCACTTGAA GGAGGACAAT ATCC AACCTT ATTCAGTGTT CTTG A AAA TG 

2 [ 5 i aaCAAAATGA CTTaaaTGCa aaG aaaaCaT TaaTTGTTGa aGCaaaGCAa 
220 1 aGAAATTATT TTAAGAAAG A Ta aCTTaCTT CAACCAGTCA TTAATATTTG 
225 ! CCAATATAATTACATTAACT TTaaTGGGAA GTTTTaTAAA CAAACAAAAG 
2301 CAATTCCTCA AGGTCTTTGa GTTTCATCAA TTTTGTCATC ATTTTA7TAT 
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235 I GCAACATTAG A GG AAAGCTC CTTAGGATTC CTTAGAGATG AATCAATGAA 
240 ! CCCT GAAAAT CCAAATGTTA ATCTTCTAAT GAGACTTACA GATGACTATC 
24 5 1 TTTTGATTAC AACTCAAGAG AATAATGCAG TATTGTTTAT TGAGAAACTT 
250! ATAAAC GTAA GTCGTGAAAA TGGATTTAAA TTCAATATGA AGAAACTACA 
255 ! GACTAGTTTT CCATTAAGTC CAAGCAAATT TGCAAAATAC GGAATGGATA 
260 1 GTG i i G AGO A GCAA AATATT GTTCAAGATT ACTGCGATTG G ATTGGCATC 
265! TCAATTGATA TGAAAACTCT TGCTTTAATG CCAAATATTA ACTTGAGAAT 
270 1 AGAAGGAATT CTGTGTACAC TCAATCTAAA CATGCAAACA AAGAAAGCAT 
2751 CAATG iGGCT CAAGAAGAAA CTAAAGTCGT TTTTAATGAA TAACATTACC 
220! CATTATTTTA GAAAGACGAT TACAACCGAA GACTTTGCGA ATAAAACTCT 
235! CAACAAGTTA TTTATATCAG GCGGTTACAA ATACATGCAA TGAGCCAAAG 
290! AATACAAGGA CCACTTTAAG AAGAACTTAG CTATGAGCAG TATGATCGAC 
2?5 ' TTA £ A f G J AT CTAAAATTAT A TA CTCTGTA ACCAGAGCATTCTTTAAATA 
300 ! CC tiGiGTGC AATATTAAGG A TACAATT7T TGGAGAGGAG CATTaTCCAG 
305 i ACTTTTTCCT TaGCaCACTG AAGCACTTTA TTGAAATATT CaGCaCA-AAA 
310! .AAGTACA7TT TCAACAGAGT TTGCATGATC CTCAAGGCA-A A.AG AA GC AAA 
3 15! GCTAAAAAG i GACCAATGTC AATCTCTAAT TCAATATGAT GCATAGTCGA 
320! C i ATT C7AA C TTa TTTTG GA AAGTTAATTT TC AA TTTTTG TCTTATATAC 
325! iGGGGi i i iG GGG7TT7GGG GT7TTGGGG 
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! MEVDVDNQAD NHGIKSALK.T CEEIKEAKTL YSWIQKVIRC RNQSQSKYKD 
5 ! LEDDOFAQT NIVATPRDYN EEDFSCVIARK EVFSTGLMIE LIDKCLVELL 
• 0 1 SSSDVSDRQK LQCFGFQLKG NOLAiCTHLLT ALSTQKQYFF QDEWNQVRAM 
: 5 1 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKKQKGGA 
201 ADMNEPRCCS TCKYNVKNEK DHFLNNINVP NWNNMKSRTR IFYCTKFNRN 
15 1 NQFFKKHEFV SNKNNISAMD RAQTIFTNIF RFNRIRKKLK DKVIEKJAYM 
301 LSKVKDFNFN YYLT7CSCPLP ENWRERXQKI ENLINKTREE KSKYYEELFS 
351 YTTDNKCVTQ FFNEFFYNIL PKLDFLTGRNR KNFQKKVKKY VELNKHELIH 
-01 KNLLLEKINT REISWMQVET SAfCHFYYFDH ENIYVLWKLL RWTFEDLWS 
-s5 1 LERCF7YVTE QQKSYSKTYY YRKNIWDVIM KMSIADLKKE TLAEVQEKEV 
501 EEWKXSLGFA PGrCLRLIPKX TTFRPIMTFN KKIVNSDRKT TKLTTNTCLL 
5 5 1 NSHLMLKTLK NRMFKDPFGF A VFNYDDVMK KYEEFVCKWK QVGQPKLFFA 
501 TMDEKCYDS VNREKLSTFL KTTKLLSSDF WIMTAQILKR KNNTVtDSKN 
55 i FRKKEMKDYF RQKPQKJALE GGQ YPTLFSV LENEQNDLNA KJCTLrVEAiCQ 
'■j 1 RNYFKKDNLL QPVTNICQYN YINFNGKFYK QTKGIPQGLC VSSILSSFYY 
"I ATLEESSLGF LRDESMNPEM PMVNLLMRLT DDYLLITTQE NNAVLFIEKL 
50 i [NVSRENGFK FNMKKLGTSF PLSPSKFAKY GMDSVEEQNI VQDYCDV/1GI 
55 1 S IDMKTLALM PNINLRIEGi LCTLNLNMQT KKASMWLKKX. LKSFLMNMTT 
?0l KYFRKTTTTE DrANKTLNKL FISGGYKYMQ CAKEYKDHFK KNLAMSSMID 
= 51 LEVSKIIYSV TRAFFKYLVC NIKDTIFGES HYPDFFLSTL KHFIEIFSTX 
CO! rCYIFNRVCMI LKAKEAKLKS DQCQSLIQYD A 
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r = 
111 

ry 
a 
a 



1 ggzaccgaccr2ccr::cc::c:tc2caagcc2aL:gccrccnc?aacccccc:2aaccici?gaaatact::ucaaca 80 

3 1 accczacaacaacaccaagccaaantccaacacgaaggzgcracnagncaccgacaacaLLticraccccaLccgccgcta 160 
161 ccsagnacaaggacaaaaagaacaaccrcccrccccccaaagacccttaciiraucaacttaccitrcaaacacacttcg 2 40 

2 41 ggcticgccraccrrraaccqcggcaccgcr^ragccgc^actiticragccaaccgcgcgticrctaccccgccatrggatac 320 

3 21 agcccrcggagcagcccacagaaaccccracaaacccccrgacgagaccacaccagacrcaccacagcccgcgcacatcc 400 
401 --aacacggagcczcacacccragacgagccacgccgcatigacggagcarcrggraccacccaacgcrrgccttigaaaag 480 

4 81 cczgacaacnanccgcaaaaccatgccczragrggcggcaacccgcgaaagcczcccgacgcccgcacacgcczagcacg 560 

5 61 acigagacacccaaaaattLccacccaccacaaccccccraacgcggccrraccccccractttccactcccangccgcc 640 
ccaaanacgtaccaccccgtaccaggcccczticcgc^tactcccggaaccgcacccccttcaccactccccctaatga 720 
acaacccaaactagnttcgcttacaactgacagcagcagaaagactggcgatcctacccgtigcaacgctactagtccaaa 800 



641 
721 



3 01 cacaccttgcaaaacacctaccagccancactacacaaaaaaaatccrataaccacaaacaccaaccaacacccgcggcc 880 
881 accacccatctaaaacgccacgaccagcaggacacctrgcacacacatagccacgciraacggtcacttgcaacttgc 958 



959 ATG ACC GAA CAC CAT ACC CCC AAA AGC AGG ATT CTT CGC 

I M 7 Z H H 7 ? K S R I L R 



CTA GAG AAT CAA TAT GTA LOIS 
u E M Q Y V 20 
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a 

ry 

m 
a 
m 



1019 TAC CTA TGT ACC 

31 r L C T 

10 7? TAT AGC AAT ATA 
41 T S M I 

112 9 CAT TCG ACT GTA 
" 31 K 3 T V 



AAT GAT TAT GTA CAA CTT GTT TTG AGA GGG TCC CCG GCA AGC TCG 

k V 0 r V Q L V L R G S ? A S S 



CAA CGC TTC AGA AGC GAT GTA CAA ACC TCC TTT 

zLRLRSDVQTS F 

GGC TTC GAC AGT AAG CCA GAT GAA GGT GTT CAA 

GFQSX ? O Z G V Q 



ATT TTT CTT 



CCA 
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1078 
40 

1138 
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L98 
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1199 AAA 7GC 7CA CAG 7CA GAG gcataw:attm7,mr;act::r:: = :ac:c?ggaca(?c:aatata:c5gcag 1272 
31 K 2 S } s £ ' 86 

127 3 CTA ATA GCG AA7 GTT GTA AAA CAG ATG TTC GAT GAA AGT TTT GAG CGT CGA AGG AAT CTA 1332 
37 L I A M 7 V ;< Q M F D Z S ~ Z R R ~ U L 106 

1333 CTG ATG AAA GGG TTT TCC ATG ccaacgcaccccaaLcgcgaaacacc:acccgcaactaczg;::caaacaga L405 
T07 1 M X G F S M " 1X3 

1406 c;?cac::aaccgacaaag AAT CAT GAA GAT TTT CGA GCC ATG' CAT GTA AAC GGA GTA CAA AAT 1469 
114 MHSDrRAMKV-MGVQN 128 

1470 GAT CTC GTT TCT ACT TTT CCT AAT TAC CTT ATA TCT ATA CTT CAG TCA AAA AAT TGG CAA 1529 
129 D 1 V S T F ? N Y L I S I 1 Z 5 K N W Q 148 

1530 CTT TTG TTA GAA AT gtaaacacccgr raaga tg:: :?c?cacc::?aacaagaccgacaagcacag T ATC GGC 1601 

14 9 1 1 1 £ I r G LS5 

15 02 AGT GAT GCC ATC CAT TAC TTA TTA TCC AAA GGA AGT ATT TTT GAG GCT CTT CCA AAT GAC 16 61 

. !a .iS6 sdam •< r 11s :<gs r r e a l ? m □ 17s 

M = 52 AAT TAC CTT CAG ATT TCT GGC ATA CCA CTT TTT AAA AAT AAT CTC TTT GAG GAA ACT CTG 1721 

^176 N' Y L Q I S G I ? 1 r i< M N V ~ E Z 7 '/ 195 

H722 TCA AAA AAA AGA AAG CGA ACC ATT GAA ACA TCC ATT ACT CAA AAT AAA AGC GCC CGC AAA 1781 

I fl 1 9^ s K ;< r :< r t 1 z t s 1 t Q n :< s a ?. :< 215 

p782 GAA GTT TCC TGG AAT AGC ATT TCA ATT AGT AGG TTT AGC ATT TTT TAC AGG TCA TCC TAT 1841 

, 216 e v s w n s rs r s a f s ityrssy 235 
= . 3 4 2 AAG AAG TTT AAG CAA G gcaac:aacac:gc:a:cc:icacaaccaacc::ag AT CTA TAT TTT AAC 

rn 907 

: : :~]23 6 K K F :< Q D 1 Y F N 245 

J;f 90S TTA CAC TCT ATT TCT GAT CGG AAC ACA GTA CAC ATG TGG CTT CAA TGG ATT TTT CCA AGG 1967 

tJ2 4 6 1 H S I C D a N 7 V H M W L Q W : F ? R 265 

19 68 CAA TTT GGA CTT ATA AAC CCA 777 CAA G7G AAG CAA TTG CAC AAA CTC ATT CCA CTG GTA 2027 

"-56 Q FG1INAFQVKQ 1HK7IPLV 285 

2028 TCA CAG AGT ACA CTT CTG CCC AAA CGT CTC CTA AAG GTA TAC CCT TTA ATT GAA CAA ACA 2087 

236 S Q S T V V ? K R L L K V Y ? L I £ Q T 305 

2088 GCA AAG CGA CTC CAT CGT ATT TCT CTA TCA AAA GTT TAC AAC CAT TAT TGC CCA TAT ATT 2147 

306 AXRLHRISLSKVYMHYC?rr 325 

2148 GAC ACC CAC GAT GAT GAA AAA ATC CTT AGT TAT TCC TTA AAG CCC AAC CAG GTG TTT GCG 2207 

326 D 7 H D D £ K [ L S Y S 1 K ? M Q V F A 345 

2208 777 CTT CGA TCC ATT CTT CTT CGA GTG TTT CCT AAA TTA ATC TGG GGT AAC CAA AGG ATA 22 57 

346 F L a S Z 1 7 R 7 F ? K 1 Z w G M Q R 1 365 

■263 777 CAG A7A A7A 77A AAA G a:atcr:acaaaac::a::accac:aacca::"accag AC C7C GAA ACT 23 3 6 

3 66 F Z I : 1 .< D 1 £ 7 375 
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22 3 7 TTC TTC AAA TTA TCC AGA TAG GAG TCT TTT AGT TTA CAT TAT TTA ATG AGT AAC ATA AAG 23 95 
375 F - :< L S 3 Y £ S r S f - H Y - M 3 N Z X 395 

2 3 97 ccaacacgccaaacctrcruccscraaccaacaatcag ATT TCA GAA ATT GAA TGG CTA GTC CTT GGA 2 4 65 
•95 I S Z I Z W L V t G 40S 

2466 AAA AGG TCA AAT GCG AAA ATG TGC TTA AGT GAT TTT GAG AAA CGC AAG CAA ATA TTT GCG 2525 
* 406 X ?. S M A X M C L S D F £ X R X Q r F A 425 

2525 GAA TTC ATC TAC TGG CTA TAC AAT TCG TTT ATA ATA CCT ATT TTA CAA TCT TTT TTT TAT 2SS5 
426 SrlYWLYNSFrrPZLQSFFY 445 

2S3S ATC ACT GAA TCA AGT GAT TTA CGA AAT CGA ACT GTT TAT TTT AGA AAA GAT ATT TGG AAA 2 6 45 
■J46:T£SS0LRNR TVYrRKDlWR 465 

2 546 CTC TTC TGC CGA CCC TTT ATT ACA TCA ATG AAA ATG GAA GCG TTT GAA AAA -ATA AAC GAG 2705 

'! 6 6 u - C R ? F I T S M X M Z A F Z X I N £ 485 

2706 g^a::itaaac?cacc:ctcqcaaaaaqccaacac:::cag AAC AAT GTT AGG ATG GAT ACT CAG AAA ACT 2775 

486 N N V R X D T Q X T 495 

277 6 ACT TTG CCT CCA GCA GTT ATT CGT CTA TTA CCT AAG AAG AAT ACC TTT CCT CTC ATT ACG 233 5 

496 TL??AVIRLL?XXNTFRLtT 515, 

283 6 AAT TTA AGA AAA AGA TTC TTA ATA AAG gcactaaccc::ggccaccaacgcacct^c::z:aacctacta 2906 

516 M 1 R X R F L I X 524 

2 907 rragcag ATG GGT TCA AAC AAA AAA ATG TTA GTC AGT ACG AAC CAA ACT TTA CGA CCT GTC 2967 

525 MGS MXXMLVSTMQTLRPV 542 

2 9*6 3 GCA TCG ATA CTC AAA CAT TTA ATC AAT GAA GAA AGT AGT GGT ATT CCA TTT AAC TTG GAG 3 027 
^43 ASILKKLINclESSGIPFNLE 552 

302a GTT TAC ATG AAG CTT CTT ACT TTT AAG AAG GAT CTT CTT AAG CAC CGA ATG TTT GG gcaac 3088 

563 V Y h X L L T F X X O L L X H R M F G 581 

3089 :acacaatgcgcgactcctcattaccaactczgcag G CGT AAG AAG TAT TTT CTA CGG ATA GAT ATA 3155 

382 R X X Y F V R : o t 591 

3155 AAA TCC TCT TAT GAT CCA ATA AAG CAA GAT TTG ATG TTT CGG ATT GTT AAA AAG AAA CTC 3215 

592 X S C Y O R I X Q 0 L M F R r V X X X " L 611 

3 216 AAG GAT CCC GAA TTT GTA ATT CGA AAG TAT GCA ACC ATA CAT GCA ACA AGT GAC CGA GCT 3 275 
612 X 0 ? £ F V I R XY AT I HAT'SO R A 631 

3 27 6 ACA AAA AAC TTT GTT AGT GAG GCG TTT TCC TAT T gcaagctnatctntccactggaaccccicaacaa 3 3 43 

632T X N F V S £ A F S Y F 64 3 

3 344 ac:c::::trag TT CAT ATG GTG CCT TTT GAA AAA GTC GTG CAG TTA CTT TCT ATG AAA ACA 3 405 

5<U 0 « V ? F£XVVQLLSMXT 659 

3 406 TCA GAT ACT TTG TTT GTT GAT TTT GTG GAT TAT TGG ACC AAA AGT TCT TCT CAA ATT TTT 3 4 65 

5 50 Z D T - F V □ F V D Y W T .•: S 5 S Z I r 379 

3 4 66 AAA ATG CTC AAG GAA CAT CTC TCT GGA CAC ATT GTT AAG gcataccaaccgc:;aaccgcaacaaca 3 5 32 

53 0 :-; - 1 X Z H l s G H : V X 5 92 
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3523 ccaacczaaciag ATA GGA AAT TCT CAA TAC CTT CAA AAA GTT GGT ATC CCT CAG GCC TCA 3593 
69 3 I G >J S 3 Y „ L Q '< V G I ? Q C S 708 

3 594 ATT CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG GAA GAT TTG ATT GAT GAA TAC CTA TCG 3 653 

7091 1 S S r L C rt F V M Z 0 L I D Z Y U S 728 

3 55 4 TTT ACG AAA AAG AAA GGA TCA GTG TTG TTA CGA GTA GTC GAC GAT TTC CTC TTT ATA ACA 3713 
. 729 FTKKKGSVLLRVVDOrLFlT 748 

3 714 GTT AAT AAA AAG GAT GCA AAA AAA TTT TTG AAT TTA TCT TTA AGA G gtgagctrccgccantCC 3777 
7 4 9 v y :< K 0 A :< :< F l n l s L R G 764 

3773 taagccrcaaccgc^gaag GA TTT GAG AAA CAC AAT TTT TCT ACG AGC CTC GAG AAA ACA GTA 3340 
755 FEXHNrSTS LEKTV 778 

3 34 1 ATA AAC TTT GAA AAT AGT AAT GGG ATA ATA AAC AAT ACT TTT TTT AAT GAA AGC AAG AAA 3 900 
7 79lMr£NSMGI iNtfTFrNSS.KK 798 

3 901 AGA ATG CCA TTC TTC GGT TTC TCT GTC AAC ATG AGG TCT CTT GAT ACA TTG TTA CCA TGT 3 960 
799 R M ?~~G~S { SHHR S L D T L L A C 818 

3 95 L - - T AAA ATT GAT GAA GCC TTA TTT AAC TCT ACA TCT GTA GAG CTG ACG AAA CAT ATG GGG 4 020 
319 ? >: : D £ A L F X S T S V £ u T < H M G 838 

4021 .AAA TCT TTT TTT TAC AAA' ATT CTA AG gcacacrgcgcaactgaacaacagccgacaaacaaccag A TCG 4089 
- 339 K 3 F F Y K I L R S 848 

4090 AGC CTT GCA TCC TTT GCA CAA. GTA TTT ATT GAC ATT ACC CAC AAT TCA AAA TTC AAT TCT 4149 

349 SLASFAQVF I D ZTHXSKFNS 868 

4 ISjG TCC TGC AAT ATA TAT AGG CTA GGA TAC TCT ATG TGT ATG AGA GCA CAA GCA TAC TTA AAA 4 209 
3 69 C C N I Y R L G Y S rt C M R A Q A Y L K 388 

4 210 AGG ATG AAG GAT ATA TTT ATT CCC CAA AGA ATG TTC ATA ACG G gcgagcactcac^- -aaccaga 4274 

389 a m k a : f r ? o a m f r t □ 903 

4 27 5 aaagccacraaccaaccccag AT CTT TTG AAT GTT ATT GGA AGA AAA ATT TCG AAA AAG TTG GCC 4 3 39 

90 4 L L N V I G R K I W K K L A 917 

4 340 GAA ATA TTA X3GA TAT ACG AGT AGG CGT TTC TTG TCC TCT GCA GAA GTC AAA TG gcacgcgcc 4401 
918 ErUGYTSRRFLSSAEVKW 935 

4 40 2 ggcczcgagaccrcagcaacactgacacaccag G CTT TTT TGT CTT GGA ATG AGA GAT GGT TTG AAA 4468 
936 LFCLGrtROCLK 946 

4469 CCC TCT TTC AAA TAT CAT CCA TGC TTC GAA CAG CTA ATA TAC CAA TTT CAG TCA TTG ACT 4 528 
947 ?SrKYHPCr£QLr YQFQHLT 966 

4 529 GAT CTT ATC AAG CCG CTA AGA CCA GTT TTC CCA CAG GTG TTA TTT TTA CAT AGA AGA ATA 4 588 
5 67 D L I K ? L R ? V L R Q */ L. F L H ?. R I 986 

4589 GCT GAT TAA cgccacc^-caacccaccacacacacccccraccacuggcg-ccraaacaacactaccaccaagcaca 4665 
987 A 3 • 989 
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4 66 6 qcn^accccc:aaagcaagcacaccacaggaccr = tagc3iaagt:aaaaccaaccr.cgccacc^gccrr?accgaccz?ccc 4 74 5 
474 6 dzaccczcatactrtraagaaagaccgacagcggccgccgaccaccgcccacacgcccactiaaacgggagcggccaaaca 4 82S 
-4 82 6 c-aaaagcaacacacgaggccaacc^ccccrcactcagaacaaggaaagcggcccrccacaacgaacaacgcccgcacca 4905 

4 90 6 acgcaaaaagacgaagactaeccrcraaacaagggggaciiaagcacacccgaaggaaaagagagcaacatacccagcgct. 4 9 8 5 
4986 gccgaagaaagcaaggacaaLttggaacaagcczcrgcagacgacaggccaaaccrcggcgaccgaaccrrggcaaaagc 5065 
50 66 cccaggcracccacggcggccggccccgccaczgagacgaaaagaaaccaaggacagcctgaacactaacagczcactca SI 4 5 
514 6 acgtcccacacaaggccctgctrrtiticccgacczcaacrrrgcacgggcgaaaagaaacagcgccaagccatcaccggac 5225 

5 22 6 cccgaaacagccaaacccccrggctccrcaaagcggaagccnaaagaacc^accgaagccracgaggccrcaaaaacccc 53 05 
5306 cczzgacncaaaggaggaacccrccaccgaccaggaaacggacagcccaccagcrcccgaggagaagcccaaccrrcrgc 53 35 
53S6 aaaaaagaaaacaccancgggagacacccccrgacgaaccagacgcggagagcacccccagcggaccctrgacgccaaca 5465 
5 4 6 6 accccracrrccgaaacgcacggccccaccgccrccczgaccccccgcagccrcacgcagccaagcgaccaaaggcacc 5 544 
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1 gcagcgctgc gtcctgctgc gcacgtggga agccctggcc ccggccaccc ccgcgatgcc 
61 gcgcgctccc cgctgccgag ccgtgcgctc cctgctgcgc agccactacc gcgaggtgct 
121 gccgctggcc acgttcgtgc ggcgcctggg gccccagggc tggcggctgg tgcagcgcgg 
181 ggacccggcg gctttccgcg cgctggtggc ccagtgcctg gtgtgcgtgc cctgggacgc 
241 acggccgccc cccgccgccc cctccttccg ccaggtgtcc tgcctgaagg agctggtggc 
3 01 ccgagtgctg cagaggctgt gcgagcgcgg cgcgaagaac gtgctggcct tcggcttcgc 

3 61 gctgctggac ggggcccgcg ggggcccccc cgaggccttc accaccagcg tgcgcagcta 
421 cctgcccaac acggtgaccg acgcactgcg ggggagcggg gcgtgggggc tgctgctgcg 

4 81 ccgcgtgggc gacgacgtgc tggttcacct gctggcacgc tgcgcgctct ttgtgctggt 
541 ggctcccagc tgcgcctacc aggtgtgcgg gccgccgctg taccagctcg gcgctgccac 
601 tcaggcccgg cccccgccac acgctagcgg accccgaagg cgtrctgggat gcgaacgggc 
661 ctggaaccat agcgtcaggg aggccggggc ccccccgggc ctgccagccc cgggtgcgag 
721 gaggcgcggg ggcagtgcca gccgaagtct gccgttgccc aagaggccca ggcgnggcgc 
781 tgcccctgag ccggagcgga cgcccgttgg gcaggggtcc tgggcccacc cgggcaggac 
8 41 gcgtggaccg agtgaccgrg gtttctgtgt ggtgccacct gccagacccg ccgaagaagc 
901 cacctctttg gagggtgcgc tctctggcac gcgccactcc cacccatccg tgggccgcca 
961 gcaccacgcg ggccccccan ccacatcgcg gccaccacgt cccngggaca cgccttgtcc 

1021 cccggtgtac gccgagacca agcacctccc ctactcctca ggcgacaagg agcagctgcg 
1081 gccctccttc ctactcagcn ctctgaggcc cagcctgacc ggcgctcgga ggctcgtgga 
1141 gaccatcctt: ctgggcccca ggccctggat gccagggact ccccgcaggc tgccccgcct 
1201 gccccagcgc tactggcaaa tgcggccccr gtttctggag cngcttggga accacgcgca 
1261 gtgccccrac ggggtgctcc ccaagacgca ctgcccgctg cgagctgcgg tcaccccagc 
1321 agccggtgtc tgtgcccggg agaagcccca gggctctgtg gcggcccccg aggaggagga 
13 81 cacagacccc cgtcgcctgg tgcagctgct ccgccagcac agcagccccc ggcaggtgta 
1441 cggcttcgtg cgggccrgcc tgcgccggct ggtgccccca ggcctzctggg gctccaggca 
1501 caacgaacgc cgcttcctca ggaacaccaa gaagttcatc tccctgggga agcatgccaa 
1561 gctctcgctg caggagctga cgrggaagac gagcgngcgg gactgcgctt ggctgcgcag 
1621 gagcccaggg gttggccgtg ttccggccgc agagcaccgt ctgcgtgagg agate ctggc 
1681 caagttcctg cactggctga tgagtgtgta egtegtcgag ctgctcaggt: ctttctttta 
1741 tgtcaeggag accacgcttc aaaagaacag gctctttttc taceggaaga gtgtctggag 
1801 caagttgcaa agcattggaa tcagacagca cttgaagagg gtgcagctgc gggagcngtc 
1861 ggaagcagag gtcaggcagc ategggaage caggcccgcc ctgccgacgt ccagactccg 
1921 cttcatcccc aagcctgacg ggctgcggcc gattgtgaac atggactacg tcgtgggagc 
1981 cagaaegtte cgcagagaaa agagggcega gcgtctcacc tegagggega aggcactgtt 
2041 cagcgtgctc aactacgagc gggcgcggcg ccccggcctc ctgggcgcct ctgtgctggg 
2101 cctggacgat atccacaggg cctggcgcac cttcgtgctg cgtgtgcggg cccaggaccc 
2161 gccgcctgag ctgtactttg tcaaggtgga tgtgacgggc gegtacgaca ccatccccca 
2221 ggacaggctc aeggaggtea tcgccagcat catcaaaccc cagaacacgt actgcgtgcg 
2281 teggtatgee gtggtccaga aggccgccca tgggcacgtc cgcaaggcct teaagageca 
2341 cgtctctacc ttgacagacc tccagccgta catgegacag ttcgtggctc acctgeagga 
2401 gaccagcccg ctgagggatg ccgtcgtzcat cgagcagagc tcctccctga atgaggccag 
2461 cagcggcctc ttcgaegtet: tcctacgctt catgtgccac cacgccgtgc gcatcagggg 
2521 caagtcctac gtccagtgcc aggggatccc gcagggctcc atcctctcca cgctgctctg 
2581 cagcctgtgc taeggegaca tggagaacaa gctgtttgcg gggattegge gggaegggee 
2641 gctcctgcgt ttggtggang atttcttgtrt ggtgacacct cacctcaccc aegegaaaac 
2701 cttcctcagg accctggtcc gaggtgtccc tgagnatggc tgcgtggtga acttgeggaa 
2761 gacagtggtg aacttccctg tagaagacga ggcccrgggt: ggcaeggett ttgttcagat 
2821 gccggcccac ggcctattcc cctggcgcgg cctgcrgctg gataccegga ccctggaggt 
2881 geagagegae tactccagct atgcccggac ctccatcaga gccagtctca ccttcaaccg 



'HIM *||! 



2941 cggcttcaag gctgggagga 
3001 tcacagcctg tttctggatt 
3061 caagatcctc ctgctgcagg 
3121 tcagcaagtt tggaagaacc 
3181 ctgctactcc atcctgaaag 
3241 cggccctctg ccctccgagg 
3 301 gactcgacac cgtgtcacct 
3361 gctgagtcgg aagctcccgg 
3421 actgcccrca gacttcaaga 
3481 gagcagacac cagcagccct 
3 541 cacacccagg cccgcaccgc 
3 601 catgtccggc tgaaggctga 
3661 gagtgtccag cacacctgcc 
3 721 gggccagcrt: ttcctcacca 
3781 ccagactcgc cattgtccac 
3 841 aggcggagac cctgagaagg 
3 901 ccctgcacac aggcgaggac 
3 961 gaggtgccgn gggagtaaaa 



FIGURE 16 - 
page2 
(Seq. ID. No. 1) 

acatgcgtcg caaactcttt: 
tgcaggtgaa cagcccccag 
cgtacaggtt tcacgcatgt 
ccacactttt cctgcgcgtc 
ccaagaacgc agggatgtcg 
ccgtgcagtg gctgtgccac 
acgtgccacc cctggggtca 
ggacgacgcr gactgccctg 
ccatcctgga ctgatggcca 
gtcacgccgg gctctacgtc 
tgggagcctg aggcctgagt 
gtgtccggct gaggcctgag 
gtcttcactt ccccacaggc 
ggagcccggc ttccactccc 
ccctcgccct gcccrccrrt: 
accctgggag ctctgggaat 
cctgcaccng gatgggggtc 
tactgaatat atgagrtrtt 



ggggccttgc ggctgaagtg 
acggtgtgca ccaacatcta 
gtgctgcagc tcccatttca 
atctctgaca cggcctccct 
ctgggggcca agggcgccgc 
caagcatitcc tgctcaagct 
ctcaggacag cccagacgca 
gaggccgcag ccaacccggc 
cccgcccaca gccaggccga 
ccagggaggg aggggcggcc 
gagtgtttgg ccgaggccng 
cgagtgtcca gccaagggct 
tSTSfcgcccgg ctccacccca 
cacataggaa tagtccatcc 
gccttccacc cccaccatcc 
ttggagtgac caaaggcgtg 
cctgcgggnc aaantggggg 
cagttttgaa aaaaa 



FIGURE 17 
HUMAN TRT PROTEIN SEQUENCE 
(SEQ. NO. 2) 

MPRAPRCRAVRSLLRSHYREVLPLATFVRRLGPQGWRLVQRGDP 

AAFRALVAQ CLVCVPWDARP P P AAP S FRQVS CLKELVARVLQRLCERGAKNVLAFGFA 

LLDGARGGPPEAFTTSWSYLPNTVTDALRGSGAWGLLLRRVGDDVLVHLLARCALFV 

LVAPSCAYQVCGPPLYQLGAATQARPPPHASGPRRRLGCERAWNHSVREAGVPLGLPA 

PGARRRGGSASRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRGPSDRGFCWSPA 

RPAEEATSLEGALSGTRHSKPSVGRQHHAGPPSTSRPPRPWDTPCPPVYAETKHFLYS 

5GDKEQLRPSFLLSSLRPSLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPL 

FLELLGNHAQCPYGVLLKTHCPLRAAVTPAAGVCAREKPQGSVAAPEEEDTDPRRLVQ 

LLRQHS S P WQVYGFVRACLRRLVP PGIiWGSRHNERRFLRNTKKF I S LGKHAKLSLQEL 

TWKMS VRD CAWLRRS PGVG CVP AAEKRLREE I LAKFLHWIjMS VYWELLRS F FYVTET 

TFQKNRLFFYRKSVWSKLQSIGIRQHLKRVQLRELSEAEVRQHREARPALLTSRLRFI 

P KP DGLRP I VNMD YWGARTFRREKRAERLTSRVKALFSVLNYERARRPGLLGAS VLG 

LDDIHRAWRTFVLRVRAQDPPPELYFVKVDVTGAYDTIPQDRLTEVIASIIKPQNTYC 

VRRYAWQKAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAWIEQSSSL 

NEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSILSTI1LCSL1CYGDMENKLFAG 

IRRDGLLLRLVDDFLLVTPHLTHAKTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEAIi 

GGTAFVQMPAHGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGRNMRR 

KLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRFHACVLQLPFHQQVWKNPT 

FFLRVISDTASLCYSILKAKNAGMSLGAKGAAGPLPSEAVQWLCHQAFLLKLTRHRVT 

YVPLLGSLRTAQTQIiSRKLPGTTLTALEAAANPALPSDFKTILD 



FIGURE 18 

Clone 712562 
(SEQ ID NO. 3 ) 

GGCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTT 

TTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTG 

GAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCT 

GTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACT 

CCGCTTCATCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGG 

AGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACT 

GTTCAGCGTGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCT 

GGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGA 

CCCGCCGCCTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCC 

CCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGT 

GCGTCGGTATGCCGTGGTCCAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAG 

CCACGTCCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCT 

GCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGC 

TGCTCCTGCGTTTGGTGGATGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAA 

CCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGA 

AGACAGTGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGA 

TGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGG 

TGCAGAGCGACTACTCCAGCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACC 

GCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGT 

GTCACAGCCTGTTTCTGGATTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCT 

ACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTC 

ATCAGCAAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCC 

TCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCG 

CCGGCC7TCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTCAAGC 

TGACTCGACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAGACGC 

AGCTGAGTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAACCCGG 

CACTGCCCTCAGACTTCAAGACCATCCTGGACTGATGGCCACCCGCCCACAGCCAGGCCG 

AGAGCAGACACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGC 

CCACACCCAGGCCTGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCT 

GCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGC 

TGAGTGTCCAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCC 

AGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATC 

CCCAGATTCGCCATTGTTCACCCCTCGCCCTGCCCTCCTTTGCCTTCCACCCCCACCATC 

CAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGT 

GCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGG 

GGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGOAAAAAAAAAA 

AAAAAAAAAAAAAAAA 



FIGURE 19- 



SEQ ID NO. 10 
MetSerValTyrValValGluLeuLeuArgSerPhePhe 

TyrValThrGluThrThrPheGlnLysAsixArgLeuPhePheTyrArgLysSerValTrp 
SerLysLeuGlnSerlleGlylleArgGlnHisLeuLysArgValGlnLeuArgGluLeu 
SerGluAlaGluValArgGlnHisArgGluAlaArgProAlaLeuLeuThrSerArgLeu 
ArgPhelleProLysProAspGlyLeiiArgProIleValAsnMetAspTyrValValGly 
AlaArgThrPheArgArgGluLysArgAlaGluArgLeuThrSerArgValLysAlaLeu 
PheSerValLeuAsnTyrGluArgAlaArgArgProGlyLeuLeuGlyAIaSerValLeu 
GlyLeuAspAspIleHisArgAlaTrpArgThrPheValLeuArgValArgAlaGlnAsp 
ProProProGluLeuTyrPheValLysValAspValThrGlyAlaTyrAspThrllePro 
GlnAspArgLeuThrGluVallleAlaSerllelleLysProGlnAsnThrTyrCysVal 
ArgArgTyrAlaValValGlnLysAlaAlaHisGlyHisValArgLysAlaPheLysSer 
HisValLeuArgProValProGlyAspProAlaGlyLeuHisProLeuHisAlaAlaLeu 
GlnProValLeuArgArgHisGlyGluGlnAlaValCysGlyAspSerAlaGlyArgAla 
AlaProAlaPhsGlyGly 



# 
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SEQUENCE NO. 4 (DNA) AND SEQUENCE NO. 5 (PROTEIN) 
(TRANSLATION OF A A 182 hTRT VARIANT) 



1 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his cyr arg giu val leu pro leu ala thr phe val arg arg leu 
CAC TAG CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 60 

phe arg ala leu val ala gin cys leu val cys val pro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 90 
leu iys glu leu val ala arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

110 120 

ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 
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tyr leu pro 
TAC CTG CCC 



trp gly leu 
TGG GGG CTG 



leu leu ala 
CTG CTG GCA 



ala cyr gin 
GCC TAC CAG 



thr gin ala 
ACT CAG GCC 



leu gly cys 
CTG GGA TGC 



val pro leu 
GTC CCC CTG 



ser ala ser 
AGT GCC AGC 



ala ala pro 
GCT GCC CCT 



ala his pro 
GCC CAC CCG 



asn thr val 
AAC ACG GTG 

140 

leu leu arg 
CTG CTG CGC 



arg cys ala 
CGC TGC GCG 

170 

val cys gly 
GTG TGC GGG 



arg pro pre 
CGG CCC CCG 

200 

giu arg ala 
GAA CGG GCC 



gly leu pro 
GGC CTG CCA 

230 

arg ser leu 
CGA AGT CTG 



glu pro glu 
GAG CCG GAG 

260 

gly arg thr 
GGC AGG ACG 



13 0 

thr asp ala 
ACC GAC GCA 

arg val gly 
CGC GTG GGC 

160 

leu phe val 
CTC TTT GTG 

pro pro leu 
CCG CCG CTG 

190 

pro his ala 
CCA CAC GCT 

trp asn his 
TGG AAC CAT 

220 

ala pro gly 
GCC CCG GGT 

pro leu pro 
CCG TTG CCC 

250 

arg thr pro 
CGG ACG CCC 



arg gly P r ° 

CGT GGA CCG 



leu arg gly 
CTG CGG GGG 



asp asp val 
GAC GAC GTG 



leu val ala 
CTG GTG GCT 



tyr gin leu 
TAC CAG CTC 



ser gly pro 
AGT GGA CCC 



ser val arg 
AGC GTC AGG 



ala arg arg 
GCG AGG AGG 



lys arg pro 
AAG AGG CCC 



val gly gin 
GTT GGG CAG 



ser asp arg 
AGT GAC CGT 



ser gly ala 
AGC GGG GCG 

150 

leu val his 
CTG GTT CAC 



pro ser cys 
CCC AGC TGC 

180 

gly ala ala 
GGC GCT GCC 



arg arg arg 
CGA AGG CGT 

210 

glu ala gly 
GAG GCC GGG 



arg gly gly 
CGC GGG GGC 

240 

arg arg gly 
AGG CGT GGC 



gly ser trp 
GGG TCC TGG 

270 

gly phe cys 
GGT TTC TGT 




val val ser pro ala arg pro 
GTG GTG TCA CCT GCC AGA CCC 

290 

gly ala leu ser gly thr arg 
GGT GCG CTC TCT GGC ACG CGC 



gin his his ala gly pro pro 
CAG CAC CAC GCG GGC CCC CCA 

320 

:rp asp chr pro cys pro pro 
TGG GAC ACG CCT TGT CCC CCG 



leu tyr ser ser gly asp lys 
CTC TAC TCC TCA GGC GAC AAG 

350 

leu ser ser leu arg pro ser 
CTC AGC TCT CTG AGG CCC AGC 



glu thr ile phe leu gly ser 
GAG ACC ATC TTT CTG GGT TCC 

380 

arg arg leu pro arg leu pro 
CGC AGG TTG CCC CGC CTG CCC 

leu phe leu glu leu leu gly 
CTG TTT CTG GAG CTG CTT GGG 

410 

val leu leu lys thr his cys 
GTG CTC CTC AAG ACG CAC TGC 




FIGURE 20" 
Page 3 



280 

ala glu glu ala thr ser leu glu 
GCC GAA GAA GCC ACC TCT TTG GAG 

300 

his ser his pro ser val gly arg 
CAC TCC CAC CCA TCC GTG GGC CGC 

310 

ser thr ser arg pro pro arg pro 
TCC ACA TCG CGG CCA CCA CGT CCC 

330 

val tyr ala glu thr lys his phe 
GTG TAC GCC GAG ACC AAG CAC TTC 

340 

glu gin leu arg pro ser phe leu 
GAG CAG CTG CGG CCC TCC TTC CTA 

360 

leu thr gly ala arg arg leu val 
CTG ACT GGC GCT CGG AGG CTC GTG 

370 

arg pro trp met: pro gly thr pro 
AGG CCC TGG ATG CCA GGG ACT CCC 

390 

gin arg tyr trp gin met arg pro 
CAG CGC TAC TGG CAA ATG CGG CCC 

400 

asn his ala gin cys pro tyr gly 
AAC CAC GCG CAG TGC CCC TAC GGG 

420 

pro leu arg ala ala val thr pro 
CCG CTG CGA GCT GCG GTC ACC CCA 



430 
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ala ala gly val cys ala arg 
GCA GCC GGT GTC TGT GCC CGG 

440 

ala pro glu glu glu asp thr 
GCC CCC GAG GAG GAG GAC ACA 



leu arg gin his ser ser pro 
CTC CGC CAG CAC AGC AGC CCC 

470 

ala cys leu arg arg leu val 
GCC TGC CTG CGC CGG CTG GTG 

his asn glu arg arg phe leu 
CAC AAC GAA CGC CGC TTC CTC 

500 

. leu gly lys his ala lys leu 
CTG GGG AAG CAT GCC AAG CTC 



met: ser val arg asp cys ala 
ATG AGC GTG CGG GAC TGC GCT 

530 

gly cys val pro ala ala glu 
GGC TGT GTT CCG GCC GCA GAG 



ala lys phe leu his trp leu 

GCC AAG TTC CTG CAC TGG CTG 

560 

leu arg ser phe phe tyr val 

CTC AGG . TCT TTC TTT TAT GTC 

arg leu phe phe tyr arg lys 

AGG CTC TTT TTC TAC CGG AAG 



glu lys pro gin gly ser val ala 
GAG AAG CCC CAG GGC TCT GTG GCG 

450 

asp pro arg arg leu val gin leu 
GAC CCC CGT CGC CTG GTG CAG CTG 

460 

trp gin val tyr gly phe val arg 
TGG CAG GTG TAC GGC TTC GTG CGG 

480 

pro pro gly leu trp gly ser arg 
CCC CCA GGC CTC TGG GGC TCC AGG 

490 

arg asn thr lys lys phe ile ser 
AGG AAC ACC AAG AAG TTC ATC TCC 

510 

ser leu gin glu leu thr trp lys 
TCG CTG CAG GAG CTG ACG TGG AAG 

520 

trp leu arg arg ser pro gly val 
TGG CTG CGC AGG AGC CCA GGG GTT 

540 

his arg leu arg glu glu ile leu 
CAC CGT CTG CGT GAG GAG ATC CTG 

550 

met ser val tyr val val glu leu 
ATG AGT GTG TAC GTC GTC GAG CTG 

570 

thr glu thr thr phe gin lys asn 
ACG GAG ACC ACG TTT CAA AAG AAC 

580 

ser val trp ser lys leu gin ser 
AGT GTC TGG AGC AAG TTG CAA AGC 



590 

ile gly ile arg gin his leu 
ATT GGA ATC AGA CAG CAC TTG 

ser glu ala glu val arg gin 
TCG GAA GCA GAG GTC AGG CAG 

620 

leu thr ser arg leu arg phe 
CTG ACG TCC AGA CTC CGC TTC 



pro ile val asn met. asp tyr 
CCG ATT GTG AAC ATG GAC TAC 

650 

arg glu lys arg ala glu arg 
AGA GAA AAG AGG GCC GAG CGT 

phe ser val leu asn tyr glu 
:TC A.GC GTG CTC AAC TAC GAG 

680 

gly ala ser val leu gly leu 
GGC GCC TCT GTG CTG GGC CTG 

thr phe val leu arg val arg 
ACC TTC GTG CTG CGT GTG CGG 

710 

tyr phe val lys val asp val 
TAC TTT GTC AAG GTG GAT GTG 



gin asp arg leu thr glu val 
CAG GAC AGG CTC ACG GAG GTC 
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600 

lys arg val gin leu arg glu leu 
AAG AGG GTG CAG CTG CGG GAG CTG 

610 

his arg glu ala arg pro ala leu 
CAT CGG GAA GCC AGG CCC GCC CTG 

630 

ile pro lys pro asp gly leu arg 
ATC CCC AAG CCT GAC GGG CTG CGG 

640 

val val gly ala arg thr phe arg 
GTC GTG GGA GCC AGA ACG TTC CGC 

660 

leu thr ser arg val lys ala leu 
CTC ACC TCG AGG GTG AAG GCA CTG 

670 

arg ala arg arg pro gly leu leu 
CGG GCG CGG CGC CCC GGC CTC CTG 

690 

asp asp ile his arg ala trp arg 
GAC GAT ATC CAC AGG GCC TGG CGC 

700 

ala gin asp pro pro pro glu leu 
GCC CAG GAC CCG CCG CCT GAG CTG 

720 

thr gly ala tyr asp thr ile pro 
ACG GGC GCG TAC GAC ACC ATC CCC 

730 

ile ala ser ile ile lys pro gin 
ATC GCC AGC ATC ATC AAA CCC CAG 
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740 750 
asn thr tyr cys val arg arg tyr ala val val gin lys ala ala 
AAC ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC 

760 

his gly his val arg lys ala phe lys ser his val leu arg pro 
CAT GGG CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC CTA CGT CCA 

770 780 
val pro gly asp pro ala gly leu his pro leu his ala ala leu 
GTG CCA GGG GAT CCC GCA GGG CTC CAT CCT CTC CAC GCT GCT CTG 

790 

gin pro val leu arg arg his gly glu gin ala val cys gly asp 
CAG CCT GTG CTA CGG CGA CAT GGA GAA CAA GCT GTT TGC GGG GAT 

800 807 

ser ala gly arg ala ala pro ala phe gly gly OP 

TCG GCG GGA CGG GCT GCT CCT GCG TTT GGT GGA TGA TTTCTTGTTGGT 
GACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGA 
GTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCCTGTAGAAGACGAGGC 
CCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCT 
GCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAGCTATGCCCGGACCTC 
CATCAGAGCCAGTCTCACCTTCAACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAA 
ACTCTTTGGGGTCTTGCGGCTGAAGTGTCACAGCCTGTTTCTGGATTTGCAGGTGAACAG 
CCTCCAGACGGTGTGCACCAACATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCA 
CGCATGTGTGCTGCAGCTCCCATTTCATCAGCAAGTTTGGAAGAACCCCACATTTTTCCT 
GCGCGTCATCTCTGACACGGCCTCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGG 
GATGTCGCTGGGGGCCAAGGGCGCCGCCGGCCCTCTGCCCTCCGAGGCCGTGCAGTGGCT 
GTGCCACCAAGCATTCCTGCTCAAGCTGACTCGACACCGTGTCACCTACGTGCCACTCCT 



iiNiinmi! r w nr 1 w 
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GGGGTCACTCAGGACAGCCCAGACGCAGCTGAGTCGGAAGCTCCCGGGGACGACGCTGAC 
TGCCCTGGAGGCCGCAGCCAACCCGGCACTGCCCTCAGACTTCAAGACCATCCTGGACTG 
ATGGCCACCCGCCCACAGCCAGGCCGAGAGCAGACACCAGCAGCCCTGTCACGCCGGGCT 
CTACGTCCCAGGGAGGGAGGGGCGGCCCACACCCAGGCCCGCACCGCTGGGAGTCTGAGG 
CCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAG 
GCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCCGTCTTCACTTCCC 
CACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTC 
CACTCCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCCTCGCCCTGCC 
CTCCTTTGCCTTCCACCCCCACCATCCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTC 
TGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGAT 
GGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATG 
AGTTTTTCAGTTTTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



FIGURE 21" 
Genomic DNA insert of pGRN144 

Seq* ID. No. 6 

1 CCATGGGACCCACTGCAGGGGCAGCTGGGAGGCTGCAGGCTTCAGGTCCCAGTGGGGTTG 
GGTACCCTGGGTGACGTCCCCGTCGACCCTCCGACGTCCGAAGTCCAGGGTCACCCCAAC 

6 1 CCATCTGCCAGTAGAAACCTGATGTAGAATCAGGGCGCGAGTGTGGACACTGTCCTGAAT 
GGTAGACGGTCATCTTTGGACTACATCTTAGTC C CGCGCTCACAC CTGTGACAGGACTTA 

121 CTCAATGTCTCAGTGTGTGCTGAAACATGTAGAAATTAAAGTCCATCCCTCCTACTCTAC 
GAGTTACAGAGTCACACACGACTTTGTACATCTTTAATTTCAGGTAGGGAGGATGAGATG 

181 TGGGATTGAGCCCCTTCCCTATCCCCCCCCAGGGGCAGAGGAGTTCCTCTCACTCCTGTG 
ACCCTAACTCGGGGAAGGGATAGGGGGGGGTCCCCGTCTCCTCAAGGAGAGTGAGGACAC 

241 GAGGAAGGAATGATACTTTGTTATTTTTCACTGCTGGTACTGAATCCACTGTTTCATTTG 
CTCCTTCCTTACTATGAAACAATAAAAAGTGACGACCATGACTTAGGTGACAAAGTAAAC 



**************************************** 

301 TTGGTTTGTTTGTTTTGTTTTGAGAGGCGGTTTCACTCTTGTTGCTCAGGCTGGAGGGAG 
AACCAAACAAACAAAACAAAACTCTCCGCCAAAGTGAGAACAACGAGTCCGACCTCCCTC 



361 TGCAATGGCGCGATCTTGGCTTACTGCAGCCTCTGCCTCCCAGGTTCAAGTGATTCTCCT 
ACGTTACCGCGCTAGAACCGAATGACGTCGGAGACGGAGGGTCCAAGTTCACTAAGAGGA 

alu 

421 GCTTCCGCCTCCCATTTGGCTGGGATTACAGGCACCCGCCACCATGCCCAGCTAATTTTT 
CGAAGGCGGAGGGTAAACCGACCCTAATGTCCGTGGGCGGTGGTACGGGTCGATTAAAAA 



481 TGTATTTTTAGTAGAGACGGGGGTGGGGGTGGGGTTCACCATGTTGGCCAGGCTGGTCTC 
ACATAAAAATCATCTCTGCCCCCACCCCCACCCCAAGTGGTACAACCGGTCCGACCAGAG 

CAP 



541 GAACTTCTGACCTCAGATGATCCACCTGCCTCTGCCTCCTAAAGTGCTGGGATTACAGGT 
1 CTTGAAGACTGGAGTCTACTAGGTGGACGGAGACGGAGGATTTCACGACCCTAATGTCCA 



601 GTGAGCCACCATGCCCAGCTCAGAATTTACTCTGTTTAGAAACATCTGGGTCTGAGGTAG 
CACTCGGTGGTACGGGTCGAGTCTTAAATGAGACAAATCTTTGTAGACCCAGACTCCATC 



' li' 1 1 !l II 
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CCAAT 
*****★*+***★★*★> 

661 GAAGCTCACCCCACTCAAGTGTTGTGGTGTTTTAAGCCAATGATAGAATTTTTTTATTGT 
CTTCGAGTGGGGTGAGTTCACAACACCACAAAATTCGGTTACTATCTTAAAAAAATAACA 

721 TGTTAGAACACTCTTGATGTTTTACACTGTGATGACTAAGACATCATCAGCTTTTCAAAG 
ACAATCTTGTGAGAACTACAAAATGTGACACTACTGATTCTGTAGTAGTCGAAAAGTTTC 

CAP 

781 ACACACTAACTGCACCCATAATACTGGGGTGTCTTCTGGGTATCAGCGATCTTCATTGAA 
TGTGTGATTGACGTGGGTATTATGACCCCACAGAAGACCCATAGTCGCTAGAAGTAACTT 

CAP 

****+*★★***★ 

841 TGCCGGGAGGCGTTTCCTCGCCATGCACATGGTGTTAATTACTCCAGCATAATCTTCTGC 
ACGGCCCTCCGCAAAGGAGCGGTACGTGTACCACAATTAATGAGGTCGTATTAGAAGACG 



***> 

901 TTCCATTTCTTCTCTTCCCTCTTTTAAAATTGTGTTTTCTATGTTGGCTTCTCTGCAGAG 
AAGGTAAAGAAGAGAAGGGAGAAAATTTTAACACAAAAGATACAACCGAAGAGACGTCTC 

CAP 

961 AACCAGTGTAAGCTACAACTTAACTTTTGTTGGAACAAATTTTCC^\ACCGCCCCTTTGC 
TTGGTCACATTCGATGTTGAATTGAAAACAACCTTGTTTAAAAGGTTTGGCGGGGAAACG 

1021 CCTAGTGGCAGAGACAATTCACAAACACAGCCCTTTAAAAAGGCTTAGGGATCACTAAGG 
GGATCACCGTCTCTGTTAAGTGTTTGTGTCGGGAAATTTTTCCGAATCCCTAGTGATTCC 

1081 GGATTTCTAGAAGAGCGACCCGTAATCCTTAAGTATTTACAAGACGAGGCTAACCTCCAG 
CCTAAAGATCTTCTCGCTGGGCATTAGGAATTCATAAATGTTCTGCTCCGATTGGAGGTC 

1141 CGAGCGTGACAGCCCAGGGAGGGTGCGAGGCCTGTTCAAATGCTAAGCTTCCATAAATAA 
GCTCGCACTGTCGGGTCCCTCCCACGCTCCGGACAAGTTTACGATTCGAAGGTATTTATT 

1201 AGCAAATTTCCTCCGGCAGTTTCTGGAAAGTAGGAAAGGTTAACATTTAAGGTTGCGTTT 
TCGTTTAAAGGAGGCCGTCAAAGACCTTTCATCCTTTCCAATTGTAAATTCCAACGCAAA 

1261 GTTAGCATTTCAGTGTTTGCCGACCTCAGCTAACAGCATCCCTGCAAGGCCTCGGGAGAC 
CAATCGTAAAGTCACAAACGGCTGGAGTCGATTGTCGTAGGGACGTTCCGGAGCCCTCTG 

1321 CCAGAAGTTTCTCGCCCCTTAGATCCAAACTTGAGCAACCCGGAGTCTGGATTCCTGGGA 
GGTCTTCAAAGAGCGGGGAATCTAGGTTTGAACTCGTTGGGCCTCAGACCTAAGGACCCT 

TopoII 

1381 AGTCCTCAGCTGTCCTGCGGTTGTGCCGGGGCCCCAGGTCTGGAGGGGACCAGTGGCCGT 
TCAGGAGTCGACAGGACGCCAACACGGCCCCGGGGTCCAGACCTCCCCTGGTCACCGGCA 

1441 GTGGCTTCTACTGCTGGGCTGGAAGTCGGGCCTCCTAGCTCTGCAGTCCGAGGCTTGGAG 
CACCGAAGATGACGACCCGACCTTCAGCCCGGAGGATCGAGACGTCAGGCTCCGAACCTC 



• # 
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1501 CCAGGTGCCTGGACCCCGAGGCTGCCCTCCACCCTGTGCGGGCGGGATGTGACCAGATGT 
GGTCCACGGACCTGGGGCTCCGACGGGAGGTGGGACACGCCCGCCCTACACTGGTCTACA 

1561 TGGCCTCATCTGCCAGACAGAGTGCCGGGGCCCAGGGTCAAGGCCGTTGTGGCTGGTGTG 
ACCGGAGTAGACGGTCTGTCTCACGGCCCCGGGTCCCAGTTCCGGCAACACCGACCACAC 

1621 AGGCGCCCGGTGCGCGGCCAGCAGGAGCGCCTGGCTCCATTTCCCACCCTTTCTCGACGG 
TCCGCGGGCCACGCGCCGGTCGTCCTCGCGGACCGAGGTAAAGGGTGGGAAAGAGCTGCC 

1681 GACCGCCCCGGTGGGTGATTAACAGATATTGGGGTGGTTTGCTCATGGTGGGGACCCCTT 
CTGGCGGGGCCACCCACTAATTGTCTATAACCCCACCAAACGAGTACCACCCCTGGGGAA 

1741 CGCCGCCTGAGAACCTGCAAAGAGAAATGACGGGCCTGTGTCAAGGAGCCCAAGTCGCGG 
GCGGCGGACTCTTGGACGTTTCTCTTTACTGCCCGGACACAGTTCCTCGGGTTCAGCGCC 

1801 GGAAGTGTTGCAGGGAGGCACTCCGGGAGGTCCCGCGTGCCCGTCCAGGGAGCAATGCGT 
CCTTCACAACGTCCCTCCGTGAGGCCCTCCAGGGCGCACGGGCAGGTCCCTCGTTACGCA 

1861 CCTCGGGTTCGTCCCCAGCCGCGTCTACGCGCCTCCGTCCTCCCCTTCACGTCCGGCATT 
GGAGCCCAAGCAGGGGTCGGCGCAGATGCGCGGAGGCAGGAGGGGAAGTGCAGGCCGTAA 

1921 CGTGGTGCCCGGAGCCCGACGCCCCGCGTCCGGACCTGGAGGCAGCCCTGGGTCTCCGGA 
GCACCACGGGCCTCGGGCTGCGGGGCGCAGGCCTGGACCTCCGTCGGGACCCAGAGGCCT 

1981 TCAGGCCAGCGGCCAAAGGGTCGCCGCACGCACCTGTTCCCAGGGCCTCCACATCATGGC 
AGTCCGGTCGCCGGTTTCCCAGCGGCGTGCGTGGACAAGGGTCCCGGAGGTGTAGTACCG 

2041 CCCTCCCTCGGGTTACCCCACAGCCTAGGCCGATTCGACCTCTCTCCGCTGGGGCCCTCG 
GGGAGGGAGCCCAATGGGGTGTCGGATCCGGCTAAGCTGGAGAGAGGCGACCCCGGGAGC 

Spl 
******** 

2101 CTGGCGTCCCTGCACCCTGGGAGCGCGAGCGGCGCGCGGGCGGGGAAGCGCGGCCCAGAC 
GACCGCAGGGACGTGGGACCCTCGCGCTCGCCGCGCGCCCGCCCCTTCGCGCCGGGTCTG 

2161 CCCCGGGTCCGCCCGGAGCAGCTGCGCTGTCGGGGCCAGGCCGGGCTCCCAGTGGATTCG 
GGGGCCCAGGCGGGCCTCGTCGACGCGACAGCCCCGGTCCGGCCCGAGGGTCACCTAAGC 

2221 CGGGCAACAGACGCCCAGGACCGCGCTTCCCACGTGGCGGAGGGACTGGGGACCCGGGCA 
GCCCGTTGTCTGCGGGTCCTGGCGCGAAGGGTGCACCGCCTCCCTGACCCCTGGGCCCGT 

Spl 



E2F 
******** 

2281 CCGGTCCTGCCCCTTCACCTTCCAGCTCCGCCTCGTCCGCGCGGAACCCCGCCCCGTCCC 
GGCCAGGACGGGGAAGTGGAAGGTCGAGGCGGAGCAGGCGCGCCTTGGGGCGGGGCAGGG 

2341 GAACCCTTCCCGGGTCCCCGGCCCAGCCCCTTCCGGGCCATCCCAGCCCGTCCCGTTCCT 
CTTGGGAAGGGCCCAGGGGCCGGGTCGGGGAAGGCCCGGTAGGGTCGGGCAGGGCAAGGA 



II 
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Spl 



E2F NFkB 
********* ********* ******************** 

2401 TTTCCGCGGCCCCGCCCTCTCCTCGCGGCGCGAGTTTCAGGCAGCGCTGCGTCCTGCTGC 
AAAGGCGCCGGGGCGGGAGAGGAGCGCCGCGCTCAAAGTCCGTCGCGACGCAGGACGACG 

hTRTS 1 

*****************************> 
2461 GCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCGATGCCGCGCGCTCCCCGCTGCCGAG 
CGTGCACCCTTCGGGACCGGGGCCGGTGGGGGCGCTACGGCGCGCGAGGGGCGACGGCTC 

2521 CCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCTGCCGCTGGCCACGTTCGTGC 
GGCACGCGAGGGACGACGCGTCGGTGATGGCGCTCCACGACGGCGACCGGTGCAAGCACG 

E2F 
******* 

2581 GGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCG 
CCGCGGACCCCGGGGTCCCGACCGCCGACCACGTCGCGCCCCTGGGCCGCCGAAAGGCGC 



* 

2641 CGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGCCC 
GCGACCACCGGGTCACGGACCACACGCACGGGACCCTGCGTGCCGGCGGGGGGCGGCGGG 

NFkB 



********************************************** 

2701 CCTCCTTCCGCCAGGTGGGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCG 
GGAGGAAGGCGGTCCACCCGGAGGGGCCCCAGCCGCAGGCCGACCCCAACTCCCGCCGGC 

Topo_I I_c 1 eavag 



NFkB 
++++++++++ 
NFkB 

Intronl 

*********************************************************> 
2761 GGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTCCCCCGCAGGT 
CCCCCTTGGTCGCTGTACGCCTCTCGTCGCGTCCGCTGAGTCCCGCGAAGGGGGCGTCCA 

e site 



2821 GTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAA 
CAGGACGGACTTCCTCGACCACCGGGCTCACGACGTCTCCGACACGCTCGCGCCGCGCTT 

2881 GAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGC 
CTTGCACGACCGGAAGCCGAAGCGCGACGACCTGCCCCGGGCGCCCCCGGGGGGGCTCCG 



2941 CTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAG 
GAAGTGGTGGTCGCACGCGTCGATGGACGGGTTGTGCCACTGGCTGCGTGACGCCCCCTC 
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3 001 CGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGTGCTGGTTCACCTGCTGGC 
GCCCCGCACCCCCGACGACGACGCGGCGCACCCGCTGCTGCACGACCAAGTGGACGACCG 

3061 ACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCC 
TGCGACGCGCGAGAAACACGACCACCGAGGGTCGACGCGGATGGTCCACACGCCCGGCGG 

3121 GCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCCACACGCTAGTGGACCCCG 
CGACATGGTCGAGCCGCGACGGTGAGTCCGGGCCGGGGGCGGTGTGCGATCACCTGGGGC 

3181 AAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCT 
TTCCGCAGACCCTACGCTTGCCCGGACCTTGGTATCGCAGTCCCTCCGGCCCCAGGGGGA 

3241 GGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTT 
CCCGGACGGTCGGGGCCCACGCTCCTCCGCGCCCCCGTCACGGTCGGCTTCAGACGGCAA 

3 301 GCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGG 
CGGGTTCTCCGGGTCCGCACCGCGACGGGGACTCGGCCTCGCCTGCGGGCAACCCGTCCC 

33 61 GTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCGTGGTTTCTGTGTGGTGTC 
CAGGACCCGGGTGGGCCCGTCCTGCGCACCTGGCTCACTGGCACCAAAGACACACCACAG 

3421 ACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCA 
TGGACGGTCTGGGCGGCTTCTTCGGTGGAGAAACCTCCCACGCGAGAGACCGTGCGCGGT 

3481 CTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCCATCCACATCGCGGCCACC 
GAGGGTGGGTAGGCACCCGGCGGTCGTGGTGCGCCCGGGGGGTAGGTGTAGCGCCGGTGG 

3541 ACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTC 
TGCAGGGACCCTGTGCGGAACAGGGGGCCACATGCGGCTCTGGTTCGTGAAGGAGATGAG 

3601 CTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAGCTCTCTGAGGCCCAGCCT 
GAGTCCGCTGTTCCTCGTCGACGCCGGGAGGAAGGATGAGTCGAGAGACTCCGGGTCGGA 

3661 GACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGG 
CTGACCGCGAGCCTCCGAGCACCTCTGGTAGAAAGACCCAAGGTCCGGGACCTACGGTCC 

3721 GACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCAAATGCGGCCCCTGTTTCT 
CTGAGGGGCGTCCAACGGGGCGGACGGGGTCGCGATGACCGTTTACGCCGGGGACAAAGA 

3781 GGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCC 
CCTCGACGAACCCTTGGTGCGCGTCACGGGGATGCCCCACGAGGAGTTCTGCGTGACGGG 

3841 GCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCGGGAGAAGCCCCAGGGCTC 
CGACGCTCGACGCCAGTGGGGTCGTCGGCCACAGACACGGGCCCTCTTCGGGGTCCCGAG 

3901 TGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCA 
ACACCGCCGGGGGCTCCTCCTCCTGTGTCTGGGGGCAGCGGACCACGTCGACGAGGCGGT 

3961 GCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTGCCTGCGCCGGCTGGTGCC 
CGTGTCGTCGGGGACCGTCCACATGCCGAAGCACGCCCGGACGGACGCGGCCGACCACGG 

4021 CCCAGGCCTCTGGGGCTCC^GGCACAACGAACGCCGCTTCCTC^GGAACACCAAGAAGTT 
GGGTCCGGAGACCCCGAGGTCCGTGTTGCTTGCGGCGAAGGAGTCCTTGTGGTTCTTCAA 
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4081 CATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCTGACGTGGAAGATGAGCGT 
GTAGAGGGACCCCTTCGTACGGTTCGAGAGCGACGTCCTCGACTGCACCTTCTACTCGCA 



**************************** 

4141 GCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGTGAGGAGGTGGTGGCCGTCGAGGGCCC 
CGCCCTGACGCGAACCGACGCGTCCTCGGGTCCACTCCTCCACCACCGGCAGCTCCCGGG 

Introii2 

************************************************************ 

4201 AGGCCCCAGAGCTGAATGCAGTAGGGGCTCAGAAAAGGGGGCAGGCAGAGCCCTGGTCCT 
TCCGGGGTCTCGACTTACGTCATCCCCGAGTCTTTTCCCCCGTCCGTCTCGGGACCAGGA 



************************************************************ 
4261 CCTGTCTCCATCGTCACGTGGGCACACGTGGCTTTTCGCTCAGGACGTCGAGTGGACACG 
GGACAGAGGTAGCAGTGCACCCGTGTGCACCGAAAAGCGAGTCCTGCAGCTCACCTGTGC 



*****> 
4321 GTGATCGAGGTCGAC 
CACTAGCTCCAGCTG 



# 
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FIGURE 23 

EST AA281296 
(Seq. ID. No. 3) 



gc 

caagttcctg 
tgtcacggag 
caagttgcaa 
ggaagcagag 
cttcatcccc 
cagaacgttc 
cagcgtgctc 



cactggctga 
accacgtttc 
agcattggaa 
gtcaggcagc 
aagcctgacg 
cgcagagaaa 
aactacgagc 



tgagtgtigta 
aaaagaacag 
tcagacagca 
atcgggaagc 
ggctgcggcc 
agagggccga 
gggcgcg 



cgtcgtcgag 
gctctttttc 
cttgaagagg 
caggcccgcc 
gattgtgaac 
gcgtctcacc 



ctgctcaggt 
taccggaaga 
gtgcagctgc 
ctgctgacgt 
atggactacg 
tcgagggtga 



ctttctttta 
gtgtctggag 
gggacgtgtc 
ccagactccg 
tcgtgggagc 
aggcactgtt: 



FIGURE 24- 
(Seq.jD. No. 9) 

TCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAG 

ACCAGCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGC 

CAGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCAT 
CAGGGGCAAGTC 
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pBB5212 pGRN133 




Approximate Cell No. 



5,000 5,000 5,000 5,000 
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PANEL A 



PANEL B 
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teiomerase 



template 
CCAAAACCCCAAAAC 



5' 



T T 



cccc-r 



teiomerase 



template 



CCAAAACCCCAAAAC 



.cent T-r 




PANEL A 



PANEL B 
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Figure 34 

I CCCCAAAACC CCAAAACCCC AAAACCCCTA TAAAAAAAGA AAAAATTGAG 

5 i GTAGTTTAGA AATAAAATAT TATTCCCGCA CAAATGGAGA TGGATATTGA 

1 0 1 TTTGGATG AT ATAG AAAATT TACTTCCTAA TACATTCAAC AAGTATAGCA 

1 5 1 GCTCTTGTAG TGACAAGAA A GG ATGCAAAA CATTGAAATC TGGCTCGAAA 

201 TCGCCTTC AT TG ACTATTCC AAAGTTGCAA AAACAATTAG AGTTCTACTT 

25 1 CTCGGATGCA AATCTTTATA ACGATTCTTT CTTGAGAAAA TTAGTTTTAA 

301 AAAGCGGAGA GCAAAGAGTA GAAATTGAAA CATTACTAAT GTTTAAATAA 

35 1 AATCAGGTAA TGAGGATTAT TCTATTTTTT AGATCACTTC TTAAGGAGCA 

401 TTATGGAGAA AATTACTTAA TACTAAAAGG TAAACA GTTT GG ATTATTTC 

451 CCTAGCCAAC AATGATGAGT ATATTAAATT CATATGAGAA TGAGTCAAAG 

50 i GATCTCGATA CATCAGACTT ACCAAAG ACA AACTCGCTAT AAAACGCAAG 

55 1 AAAAAGTTTG ATAATCGAAC AGCAGAAGAA CTTATTGCAT TTACTATTCG 

601 TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA CTCCCGAGTC 

65 1 TTGAGACAAT TGAAAAAGCT GTTTACAACT GAAGGAATCG CAGTTCTGAA 

701 AGTTCTGATG TGTATGCCAT TATTTTGTGA ATTAATCTCA AATATCTTAT 

75 1 CTCAATTTAA TGGATAGCTA TAGAAACAAA CCAAATAAAC CATGCAAGTT 

301 TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC TGAATTTATA 

85 1 TTGGATTCTT AAAGCATAGA TACACAGAAT GCTTTAGAGA CTGATTTAGC 

90 1 TTACAACAG A TTACCTGTTT TGA7TACTCT TGCTCATCTC TTATATCTTT 

95 1 AAAAGAAGCA GGCGAAATGA AAAGAAGACT AAAGAAAGAG ATTTCAAAAT 

1 00 1 TTGTTGATTC TTCTGTAACC GGAATTAACA ACAAGAATAT TAGCAACGAA 

1 05 1 AAAGAAG AAG AGCTATC ACA ATCCTGATTC TTAAAGATTT CAAAAATTCC 

I 101 AGGTAAGAGA GATACATTCA TTAAAATTCA TATATTATAG TTTTTCATTT 

N 5 1 CACAGCTGTT ATTTTCTTTT ATCTTAACAA TATTTTTTGA TTAGCTGGAA 

1 201 GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT 

I 25 1 TCACATTCAT AGATCGACCT TCATATATCC AATACGATGA TAAGGAAACA 

1301 GCAGTCATCC GTTTTAAAAA TAGTGCTATG AGGACTAAAT TTTTAGAGTC 

1351 AAGAAATGGA GCCGAAATCT TAATCAAAAA GAATTGCGTC GATATTGCAA 

1401 AAGAATCGAA CTCTAAATCT TTCGTTAATA AGTATTACCA ATCTTGATTG 

1451 ATTGAAGAGA TTGACGAGGC AACTGCACAG AAGATCATTA AAGAAATAAA 

i 501 GTAACTTTTA TTAATTAGAG AATAAACTAA ATTACTAATA TAGAGATCAG 

i 55 i CGATCTTCAA TTGACGAAAT AAAAGCTGAA CTAAAGTTAG acaataaaaa 

! 601 ATACAAACCT TGGTCAAAAT attgaggaag gaaaagaaga ccagttagca 

165 1 AAAGAAAAAA taaggcaata aataaaatga gtacagaagt gaagaaataa 

1 70 1 AAG atttatt tttttcaata atttattgaa a agaggggtt ttggggtttt 

i 75 i GGGGTTTTGG GG 



II 



CCCCAAAACCCCAAAACCCCAAAACCCCTATAAAAAAAGA?^AAAATTGAGGTAGTTTAGA 
1 + + + + + + 60 

GGGGTTTTGGGGTTTTGGGGTTTTGGGGATATTTTTTTCTTTTTTAACTCCATCAAATCT 

a PQNPKTPKPL* KKKKL.R* FR 

b PKTPKPQNPYKKRKN*GSLE- 

c PKPQNPKTP I KKEKI EVV * K- 

AATAAAATATTATTCCCGCACAAATGGAGATGGATATTGATTTGGATGATATAGAAAATT 

61 + + + + + 120 

TTATTTTATAATAAGGGCGTGTTTACCTCTACCTATAACTAAACCTACTATATCTTTTAA 

a NKILFPHKWRWILIWMI * KI 

b IKYYSRTNGDGY*FG*YRKF- 

c * N I I PAQMEMDIDLDDIENL- 

TACTTCCTAATACATTCAACAAGTATAGCAGCTCTTGTAGTGACAAGAAAGGATGCAAAA 

121 + + + + + + 180 

ATGAAGGATTATGTAAGTTGTTCATATCGTCGAGAACATCACTGTTCTTTCCTACGTTTT 

a YFLIHSTS IAALVVTRKDAK 

b TS*YIQQV*Qt,L*CQERMQN- 

c LPNTFNKYS SSC SDKKGCKT- 

CATTGAAATCTGGCTCGAAATCGCCTTCATTGACTATTCCAAAGTTGCAAAAACAATTAG 

131 + + + + + + 240 

GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTTTCAACGTTTTTGTTAATC 

a H*NIjARNRLH* LFQSCKNN* 

b IEIWLEIAFIDYSKVAKTIR- 

C tiKSGSKSPSLTIPKLQKQLE- 

AGTTCTACTTCTCGGATGCAAATCTTTATAACGATTCTTTCTTGAGAAAATTAGTTTTAA 

241 + + + + + + 300 

TCAAGATGAAGAGCCTACGTTTAGAAATATTGCTAAGAAAGAACTCTTTTAATCAAAATT 

a SSTSRMQIFITILS*EN*F* 

b VLLLGCKSL*RFFLEKISFK- 

C FYF SDANLYND S F LRKLVLK- 

AAAGCGGAGAGCAAAGAGTAGAAATTGAAACATTACTAATGTTTAAATAAAATCAGGTAA 

301 + + + + + + 360 

TTTCGCCTCTCGTTTCTCATCTTTAACTTTGTAATGATTACAAATTTATTTTAGTCCATT 

a KAESKE* KLKHY*CLNKIR* 

b KRRAKSRN*NITNV*IKSGN- 

c SGEQRVE I ETLLMFK * NQVM- 

TGAGGATTATTCTATTTTTTAGATCACTTCTTAAGGAGCATTATGGAGAAAATTACTTAA 

361 + + + + + + 420 

ACTCCTAATAAGATAAAAAATCTAGTGAAGAATTCCTCGTAATACCTCTTTTAATGAATT 

a *GLFYFLDHFLRSIMEKIT* 

b EDYSIF * ITS *GAI#WRKLLN- 

c RIILFFRSLLKEHYGENYLI- 
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TACTAAAAGGTAAACAGTTTGGATTATTTCCCTAGCCAACAATGATGAGTATATTAAATT 

421 + + + + + + 480 

ATGATTTTCCATTTGTCAAACCTAATAAAGGGATCGGTTGTTACTACTCATATAATTTAA 

a Y*KVNSLDYFPSQQ**VY*I 

b TKR*TVWI ISLANNDEYIKF- 

c LKGKQFGLFP * PTMMS I LNS- 

CATATGAGAATGAGTCAAAGGATCTCGATACATCAGACTTACCAAAGACAAACTCGCTAT 

481 + + + + + + 540 

GTATACTCTTACTCAGTTTCCTAGAGCTATGTAGTCTGAATGGTTTCTGTTTGAGCGATA 

a HMRMSQRIS IHQTYQRQTRY 

b I*E*VKGSRYIRLTKDKLAI- 

c YENESKDLDTSDLPKTNSL * - 

AAAACGCAAGAAAAAGTTTGATAATCGAACAGCAGAAGAACTTATTGCATTTACTATTCG 

541 + + + + + + 600 

TTTTGCGTTCTTTTTCAAACTATTAGCTTGTCGTCTTCTTGAATAACGTAAATGATAAGC 

a KTQEKV* *SNSRRTYCIYYS 

b KRKKKFDNRTAEELIAFTI R- 

c NARKSLI IEQQKNLLHLLFV- 

TATGGGTTTTATTACAATTGTTTTAGGTATCGACGGTGAACTCCCGAGTCTTGAGACAAT 

601 + + + + + + 660 

ATACCCAAAATAATGTTAACAAAATCCATAGCTGCCACTTGAGGGCTCAGAACTCTGTTA 

a YGFYYNCFRYRR*TPES*DN 

b MGFITIVLGIDGEIiPSLETI 

c WVLLQLF *VSTVNSRVLRQL- 

TGAAAAAGCTGTTTACAACTGAAGGAATCGCAGTTCTGAAAGTTCTGATGTGTATGCCAT 

661 + + + + + + 720 

ACTTTTTCGACAAATGTTGACTTCCTTAGCGTCAAGACTTTCAAGACTACACATACGGTA 

a *KSCI*QLKESQF * KFCCVCH 

b EKAVYN*RNRSSESSDVYAX- 

c KKLFTTEGIAVLKVLMCMPL- 

TATTTTGTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGCTATAGAAACAAA 

721 + + + + + + 780 

ATAAAACACTTAATTAGAGTTTATAGAATAGAGTTAAATTACCTATCGATATCTTTGTTT 

a YFVN*SQISYLNLMDSYRNK 

b IL*INLKYLISI*WIAIETN- 

c FCEt*ISNILSQFNG*L*KQT- 

CCAAATAAACCATGCAAGTTTAATGGAATATACGTTAAATCCTTTGGGACAAATGCACAC 

73X + + -h + +• + 840 

GGTTTATTTGGTACGTTCAAATTACCTTATATGCAATTTAGGAAACCCTGTTTACGTGTG 

a PNKPCKFNGIYVKSFGTNAH 

b QINHASLMEYTLNPLGQMHT- 

c K * TMQV*WNI R * I LWDKCTL- 

TGAATTTATATTGGATTCTTAAAGCATAGATACACAGAATGCTTTAGAGACTGATTTAGC 

841 + + + + + + 900 

ACTTAAATATAACCTAAGAATTTCGTATCTATGTGTCTTACGAAATCTCTGACTAAATCG 

a * IYIGFLKHRYTECFRD* FS 

b EFILDS * SIDTQNAIiETDLA- 

c NLYWILKA* IHRML *RLI * L - 
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TTACAACAGATTACCTGTTTTGATTACTCTTGCTCATCTCTTATATCTTTAAAAGAAGCA 
901 + + + + + + 9 6 o 

AATGTTGTCTAATGGACAAAACTAATGAGAACGAGTAGAGAATATAGAAATTTTCTTCGT 

a LQQITCFDYSCSSLISLKEA 

b YNRL PVLITLAHLLYL * KKQ - 

c TTDYLF * LLLLISYIFKRSR- 

GGCGAAATGAAAAGAAGACTAAAGAAAGAGATTTCAAAATTTGTTGATTCTTCTGTAACC 
961 + + + + + + 1020 

CCGCTTTACTTTTCTTCTGATTTCTTTCTCTAAAGTTTTAAACAACTAAGAAGACATTGG 

a GEMKRRLKKEI SKFVDSSVT 

b AK*KED*RKRFQNLLILL*P- 

c RNEKKTKERDFKIC * FFCNR- 

GGAATT AACAAC AAGAATATTAGCAACGAAAAAG AAGAAGAGCTATCACAATC C T GATTC 

1021 + + + + + + 1080 

CCTTAATTGTTGTTCTTATAATCGTTGCTTTTTCTTCTTCTCGATAGTGTTAGGACTAAG 

a GINNKNISNEKEEELSQS * F 

b ELTTRILATKKKKSYHNPDS- 

c N * QQEY *QRKRRRAITI LIL- 

TTAAAGATTTCAAAAATTCCAGGTAAGAGAGATACATTCATTAAAATTCATATATTATAG 

1081 + + + + + + 1140 

AATTTCTAAAGTTTTTAAGGTCCATTCTCTCTATGTAAGTAATTTTAAGTATATJ\ATATC 

a LKISKIPGKRDTFIKIHIL* 

b *RFQKFQVREIHSLKFIYYS- 

c KDFKNSR*ERYIH*NSYIIV- 

TTTTTCATTTCACAGCTGTTATTTTCTTTTATCTTAACAATATTTTTTGATTAGCTGGAA 

1141 + + + + + + 1200 

AAAAAGTAAAGTGTCGACAATAAAAGAAAATAGAATTGTTATAAAAAACTAATCGACCTT 

a FFISQLLFSFILTIFFD*IiE 

b FSFHSCYFLLS *QYFLISWK 

C FHFTAVIFFYLNNIF* LAGS- 

GTAAAAAGTATCAAATAAGAGAAGCGCTAGACTGAGGTAACTTAGCTTATTCACATTCAT 

1201 + + + + + + 1260 

CATTTTTCATAGTTTATTCTCTTCGCGATCTGACTCCATTGAATCGAATAAGTGTAAGTA 

a VKSIK*EKR*TEVT*LIHIH 

b *KVSNKRSARLR*LSLFTFI- 

c KKYQIREALD*GNLAYSHS *- 

AGATCGACCTTCATATATCCAATACGATGATAAGGAAACAGCAGTCATCCGTTTTAAAAA 

1261 + + + + + + 1320 

TCTAGCTGGAAGTATATAGGTTATGCTACTATTCCTTTGTCGTCAGTAGGCAAAATTTTT 

a RSTFIYPIR**GNSSHPF*K 

b DRPSYIQYDDKETAVIRFKN- 

c IDLHISNTMIRKQQSSVLKI- 

TAGTGCTATGAGGACTAAATTTTTAGAGTCAAGAAATGGAGCCGAAATCTTAATCAAAAA 
1321 + + + + + + 1380 

ATCACGATACTCCTGATTTAAAAATCTCAGTTCTTTACCTCGGCTTTAGAATTAGTTTTT 

a * CYED * IFRVKKWSRNtiNQK 

b SAMRTKFLESRNGAEILIKK- 

C V L *GZ*NF* SQEMEPKS * SKR- 
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GAATTGCGTCGATATTGCAAAAGAATCGAACTCTAAATCTTTCGTTAATAAGTATTACCA 

"* h ** H K -f- 

CTTAACGCAGCTATAACGTTTTCTTAGCTTGAGATTTAGAAAGCAATTATTCATAATGGT 



a ELRRYCKRIEL*IFR**VLF 

b NCVDIAKESNSKSFVNKYYQ- 

c IASILQKNRTLNLSLISITN- 

ATCTTGATTGATTGAAGAGATTGACGAGGCAACTGCACAGAAGATCATTAAAGAAATAAA 
1441 + + + + + + 1500 

TAGAACTAACTAACTTCTCTAACTGCTCCGTTGACGTGTCTTCTAGTAATTTCTTTATTT 

a ILIDCRDCRGNCTEDH* RNK 

b S*LIEEIDEATAQKIIKEIK- 

c L D * LKRLTRQLHRRSLKK * S - 

GTAACTTTTATTAATTAGAGAATAAACTAAATTACTAATATAGAGATCAGCGATCTTCAA 
1501 + + + + + + 1560 

CATTGAAAATAATTAATCTCTTATTTGATTTAATGATTATATCTCTAGTCGCTAGAAGTT 

a VTFIN*RIN*ITNIEISDLQ 

b *LLLXRE*TKLLI*RSAIFN- 

c NFY*LENKLNY*YRDQRSS I - 

TTG ACGAAATAAAAGC TGAAC TAAAGTTAGACAAT AAAAAATAC AAACC TTGGTCAAAAT 

1561 + + + + + + 1620 

AACTGCTTTATTTTCGACTTGATTTCAATCTGTTATTTTTTATGTTTGGAACCAGTTTTA 

a LTK*KLN*S*TIKNTNLGQN 

b *RNKS*TKVRQ*KIQTI,VKI- 

c DEIKAELKLDNKKYKPWSKY- 

ATTGAGGAAGGAAAAG AAG ACC AGTT AG C AAAAG AAAAAAT AAGG C AATAAAT AAAATG A 

1621 + + + + + + 1680 

TAACTCCTTCCTTTTCTTCTGGTCAATCGTTTTCTTTTTTATTCCGTTATTTATTTTACT 

a IEEGKEDQLAKEKIRQ* IK* 

b LRKEKKTS *QKKK*GNK * N E - 

c * GRKRRPVSKRKNKAI NKMS- 

GTACAGAAGTGAAGAAATAAAAGATTTATTTTTTTCAATAATTTATTGAAAAGAGGGGTT 
1681 +■ + + + + + 1740 

CATGTCTTCACTTCTTTATTTTCTAAATAAAAAAAGTTATTAAATAACTTTTCTCCCCAA 

a VQK * RNKRFIFFNNLLKRGV 

b YRSEEIKDLFFSIIY*KEGF- 

c TEVKK*KIYFFQ*FIEKRGF- 

TTGGGGTTTTGGGGTTTTGGGG 

1741 4- +— 1762 

AACCCCAAAACCCCAAAACCCC 

a LGFWGFG 
b WGFGVLG 
c GVLGFW 
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Figure 42 



aacicama artactaart taatcaacaa gattgataaa aagcagtaaa taaaacccaa 

> I tagatttaat ttagaaagta tcaattgaaa aatggaaatt gaaaacaact aagcacaata 

11 gccaaaagcc gaaaaangt ggtgggaact tgaattagag atgcaagaaa accaaaatga 
1 1 tatataagtt agggttaaga ttgacgatcc taagcaatat ctcgtgaacg tcactgcagc 
f I atgtttgrtg taggaaggta gttactacta agataaagat gaaagaagat atatcatcac 
) 1 taaagcactt cngaggtgg ctgagtctga tcctgagttc atctgctagt tggcagtcta 

> 1 catccgiaat gaactttaca tcagaactac cactaactac attgtagcat tttgtgttgt 

\ 1 ccacaagaat actcaaccat tcatcgaaaa gtacttcaac aaagcagtac ttttgcctaa 
U tgacttactg gaagtctgtg aatngcata ggttctctat atttttgarg caactgaatt 
1 1 caaaaatng tatcttgata ggatactttc ataagatatt cgtaaggaac tcactctccg 
)l taagigttta caaagatgcg tcagaagcaa gtmctgaa ttcaacgaat actaacttgg 
i 1 taagtattgc actgaatcct aacgtaagaa aacaatgttc cgttacctct cagttaccaa 
\ I caagtaaaag tgggattaaa ctaagaagaa gagaaaagag aatctcrtaa ccaaacrrta 
1 1 ggcaataaag gaatctgaag ataagtccaa gagagaaact ggagacataa tgaacgrtga 
\ 1 agatgcaatc aaggctrtaa aaccagcagt tatgaagaaa atagccaaga gatagaatgc 
) I catgaagaaa cacatgaagg cacctaaaat tcctaactct accrtggaat caaagtactt 
i 1 gaccttcaag gatctcatta agttctgcca tatttctgag cccaaagaaa gagtctataa 
1 1 gatccnggt aaaaaatacc ctaagaccga agaggaatac aaagcagcct ttggtgattc 
1 1 tgcatctgca cccttcaatc ctgaattggc tggaaagcgt atgaagattg aaatctctaa 
1 1 aacatgggaa aatgaactca gtgcaaaagg caacactgct gaggtttggg ataatttaat 
) I ttcaagcaat taactcccat atatggccat gttacgtaac ttgtctaaca tcttaaaagc 
i 1 cggtgtttca gatactacac actctartgt gatcaacaag arttgtgagc ccaaggccgt 
1 1 tgagaactcc aagatgrtcc ctcttcaatt ctttagtgcc attgaagctg ttaatgaagc 
1 1 agttactaag ggattcaagg ccaagaagag agaaaatatg aatcttaaag gtcaaatcga 
\ I agcagtaaag gaagttgttg aaaaaaccga tgaagagaag aaagatatgg agttggagta 
) I aaccgaagaa ggagaacttg rtaaagtcaa cgaaggaan ggcaagcaar acattaactc 
i \ cangaactt gcaatcaaga tagcagttaa caagaattta gatgaaatca- aaggacacac 
! I tgcaatctic tctgatgttt ccggnctat gagtacctca atgtcaggtg gagccaagaa 
i i gtatggncc gncgtactt gtctcgagtg tgcanagtc cttggtnga tggtaaaata 
[ i acgngtgaa aagtcctcat tctacatcrt cagttcacct agrtctcaat gcaaraagtg 
) 1 rracrtagaa gngatctcc crggagacga actccgtcct tctatgtaaa aacttttgca 

> I agagaaagga aaacnggtg gtggtactga mcccctat gagtgcang atgaatggac 
\ I aaagaataaa actcacgtag acaatatcgt tamtgtct gatatgatga ttgcagaagg 

1 1 atattcagat atcaatgtta gaggcagnc cattgttaac agcatcaaaa agtacaagga 
1 I tgaagtaaat cctaacatta aaatctngc agttgactta gaaggttacg gaaagtgcct 
) I taatctaggt gatgagttca atgaaaacaa ctacatcaag atattcggta tgagcgattc 

> 1 aatcttaaag rtcatrtcag ccaagcaagg aggagcaaat atggtcgaag ttatcaaaaa 
i 1 ctrtgccctt caaaaaatag gacaaaagtg agtttcttga gattcttcta taacaaaaat 

i 3 ctcaccccac trrtttgttt tattgcatag ccattatgaa atrtaaana ttatctattt 
\ I atttaagtta cttacatagt ttatgtatcg cagtctatta gcctattcaa atganctgc 
) I aaagaacaaa aaagactaaa a 
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MEIENNQAQQPKAEKLWWELELEMQENQNDIQVRVKIDDPKQYL 

WVTAACLLQEGSYYQDKDERRYIITKALLEVAESDPEFICQLAVYIR>mLYIRTTTN 

YIVAFCWHKNTQPF!EKYFNKAVLLPNDLLEVCEFAQVLYIFDATEFK>JLYLDRILS 

QDIRi<£LTFRKCLQRCVRS(<JSEFNEYQLGKYCTESQRKKTMFRYLSVThnCQKWDQTK 

KKRK£NLLT!aQAlK£SEDKSKilETGDIMNVEDAIKA^ 

APKIPNSTLESKYLTFKDLIKFCHISEPKERVYKILGKKYPKTEEEYKAAFGDSASAP 
FNPELAGKJlM]aEISKT\VENELSAKGNTAEVV^NLISSNQLPYMAMLRNLSNILKAGV 

SDTTHSrVTNKJCEPKAVENSKMFPLQFFSAIEAVNEAVT^ 

AVKEVVEKTDEEKKJDMELEQTEEGEFVKVNEGlGKQYINSIELAIKIAVNKhlLDEIKG 
HTAiFSDVSGSMSTSMSGGAKKYGSVRTCLECALVLGL^4VKQRCEKSSFYlFSSPSSQ 
CNKCYLE^LPGDELRPSMQKLLQEKGKLGGGTDFPYECIDEWTKNKTHVDNIVILSD 
N^IAEGYSDr>A/RGSSIVNSIKiCY^DEVNPNIKJFAVDLEGYGKCLhrLGDEFNE>f>m 

KIFGMSDSILKFISAKQGGANMVEVIKNFALQKJGQK 
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I tcaalactat taattaataa ataaaaaaaa gcaaactaca aagaaaatgt caaggcgtaa 
6 1 ctaaaaaaag ccataggctc ctataggcaa tgaaacaaat cttgattttg tatlacaaaa 
1 2 1 tctagaagtt tacaaaagcc agattgagca ttataagacc tagtagtaat agatcaaaga 
1 8 1 ggaggatctc aagcirttaa agttcaaaaa ttaagattag gatggaaact ctggcaacga 
24 1 tgatgatgat gaagaaaaca actcaaataa ataataagaa ttattaagga gagtcaatta 
301 gattaagtag caagtrtaat tgataaaaaa agttggttct aaggtagaga aagatttgaa 
361 tttgaacgaa gatgaaaaca aaaagaatgg actttctgaa tagcaagtga aagaagagta 
421 attaagaacg artactgaag aataggttaa gtanaaaat ttagtattta acatggacta 
.481 ccagttagat ttaaatgaga gtggtggcca tagaagacac agaagagaaa cagattatga 
541 tactgaaaaa tggtttgaaa tatctcatga ccaaaaaaat tatgtatcaa tttacgccaa 
60 1 ctaaaagaca tcatattgtt ggtggcttaa agattatttt aataaaaaca attatgatca 
66 1 tcttaatgta agcartaaca gactagaaac tgaagccgaa ttctatgcct ttgatgattt 
721 ttcacaaaca atcaaactta ciaataanc ttactagact gttaacatag acgttaattt 
781 tgataaiaat ctctgtatac tcgcattgct tagattttta ttatcactag aaagattcaa 
84 1 tattttgaat ataagatcti cttatacaag aaattaatat aattttgaga aaattggtga 
90 1 gctacttgaa actatcttcg cagttgtctt ttctcatcgc cacttacaag gcattcattt 
96 1 acaagncct tgcgaagcgt tctaatarn agttaactcc tcatcataaa ttagcgttaa 
i 02 1 agatagctaa ttataggtat actcrttctc tacagactta aaattagttg acactaacaa 
1 08 1 agtccaagat tattttaagt tcrtaiaaga attccctcgt ttgactcatg taagctagta 
[141 ggctatccca gttagtgcta ctaacgctgt agagaacctc aatgttttac ttaaaaaggt 
1 20 1 caagcatgct aatcrtaan tagmctat ccctacctaa ncaatrttg atttctactt 
1 26 1 tgttaattta taacatttga aartagagtt tggattagaa cc222.ta.ttt igacaaaaca 
1321 aaagctrgaa aatctactn tgagtataaa ataatcaaaa aatcttaaat ttttaagatt 
1381 aaacttrtac acctacgttg cttaagaaac ctccagaaaa cagatattaa aacaagctac 
1441 aacaatcaaa aatctcaaaa acaataaaaa tcaagaagaa actcctgaaa ctaaagatga 
1501 aactccaagc gaaagcacaa gtggtatgaa attttttgat catctttctg aattaaccga 
1561 gcttgaagat rtcagcgtia acttgtaagc tacccaagaa atttatgata gcrtgcacaa 
1 62 1 acrtngatt agatcaacaa arnaaagaa gncaaana agttacaaat atgaaatgga 
1681 aaagagtaaa atggatacat tcatagatct taagaatatt tatgaaacct taaacaatct 
] 74 1 taaaagatgc tctgnaata tatcaaatcc tcatggaaac atttcttatg aactgacaaa 
1 80 1 taaaganct acttmata aamaagct gaccnaaac taagaattat aacacgctaa 
1861 gtatacnn aagtagaacg aatmaan taataacgn aaaagtgcaa aaartgaatc 
1921 ncctcatta gaaagcrtag aagatatiga cagictngc aaatctattg cttcttgtaa 
1981 aaamacaa aatgttaata rtatcgccag mgctctat cccaacaata tttagaaaaa 
204 1 tcctncaat aagcccaatc ttctattm caagcaam gaataattga aaaatttgga 
2101 aaatgtatct atcaactgta ctcngatca gcatatacn aattctam cagaattctt 
2161 agaaaagaat aaaaaaataa aagcattcat tttgaaaaga tattatttat tacaatatta 
2221 tcttganat actaaartat ttaaaacact tcaatagna cctgaattaa attaagttta 
2281 cattaattag caattagaag aattgactgt gagtgaagta cataagtaag tatgggaaaa 
234 1 ccacaagcaa aaagctttct atgaaccan atgtgagttt atcaaagaat catcctaaac 
2401 ccrttagcta atagatrng accaaaacac tgtaagtgat gactctatta aaaagatttt 
2461 agaatctata tctgagtcta agtatcatca ttatngaga ttgaacccta gttaatctag 
252 1 cagtnaatt aaatctgaaa acgaagaaat ttaagaactt ctcaaagctt gcgacgaaaa 
258 1 aggtgtttta gtaaaagcat actataaati ccctctatgt ttaccaactg gtacttatta 
264 1 cgattacaat icagatagat ggtganaat taaatattag tttaaataaa tartaaatat 
270 1 tgaatatnc tngcttan amgaaiaa tacatacaat agtcattttt agtgttnga 
276 i atatatma gttamaat tcanatm aagtaaataa ttattmca atcattmt 
2821 aaaaaatcg 
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MSRRNQKKPQAPIGNETNLDFVLQNLEVYKSQIEHYKTQQQQIK 

EEDLKLLKFKNQDQDCNSGNDDDDEENNSNKQQELLRRVNQIKQQVQLIKKVGSKVEK 

DLNLNEDENKKNGLSEQQVKJEEQLRT1TEEQVKYQNLVFNMDYQLDLNESGGHRRHRR 

ETDYDTEKWFEISHDQKNYVSIYANQKTSYCWWLKI)YFNKNhr/DHLhrVSINRLETEAE 

FYAFDDFSQTIKJLTWJSYQTVNIDVNFDNNLCILALLRPLLSLERFNILNIRSSYTRN 

QYNFEKIGELLETIFAVVFSHRHLQGIHLQVPCEAFQYLVNSSSQISVKDSQLQVYSF 

STDLKJLVDTNKVQDYFK^LQEFPRLTHVSQQAIPVSATNAVE^hm-LKKVKHANLNL 

VSlPTQFNFDFYFVNLQHLiCLEFGLEPNILTKQKJLENLLLSIKQSKNLKFLRLNFYTY 

VAQETSRKQILKQATT1K>ILKNNKNQEETPETKDETPSESTSGMKFFDHLSELTELED 

FSVNLQATQEIYDSLHKLURSTNLKiCFKLSYKYEMEKSf^ 

RCSVNISNPHGNISYELTNKDSTFYKFKLTLNQELQHAKYTFKQ^FQFNhr/KSAKJE 

SSSLESLEDIDSLCKSIASCICNLQNVNIIASLLYPNNIQKNPFNKPNLLFFKQFEQLK 

NLEm'SINCILDQfflLNSISEFLEKNKJCIKAFILKRYYLLQYYLDYTKLFKTLQQLPE 

LNQVY[NOQLEELTVSEVHKQVWENHKQKLAFYEPLCEFIfCESSQTLQLIDFDQNTVSD 

DSIKKJLESJSESKYHHYLRLNPSOSSSLIKSENEEIQELLKACDEKGVLVKAYYKFP 

LCLPTGTYYDYN'SDRW 
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MKJLFEFIQDKLDIDLQTNSTYKENLKCGHFNGLDEILTTCFAL 

PNSRK1ALPCLPGDLSHKAVIDHCIIYLLTGELYNNVLTFGYKIARNEDVNNSLFCHS 

ANV^TLLKGAAWKMFHSLVGTYAFVDLLINYTVIQFNGQFFTQIVGNRCNEPHLPPK 

VA/QRSSSSSATAAQIKQLTEPVTNKQFLHKLNFNSSSFFPYSKILPSSSSIKKLTDLR 

EAIFPTNLVKJPQRJLKVRJNLTLQKLLKJIHKRLNYVSILNSICPPLEGTVLDLSHLSR 

QSPKERVLKFHVlLQKLLPQEMFGSKKNKGKIIKhfLNLLLSLPLNGYLPFDSLLKKL 

RLKDFR^FISDIWFTKHHFENLNQLAICFISWLFRQLIPKJIQTFFYCTEISSTVTI 

VYFRHDTWNKLITPFIVEYFKTYIVENNVCRNHNSY^ 

IIAIPCRGADEEEFTIYK£NHKNAJQPTQKJLEYLRNKJIPTSFTKIYSPTQ1ADRJ!CE 

FKQRLLKKF>^LPELYFMKJ r DVKSCYDSIPRMECMRILKJDALKNENGFFVRSQYFFN 

TNTGVLKLFNVVNASRVPKPYELYIDKVRTVHLSNQDVINVVEh^IFKTALWVEDKCY 

IREDGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPSQDTLILPCLAJDDFLIISTDQQQV 

INIKKLAMGGFQKWAKANRDKJLAVSSQSDDDTVIQFCAMHIFVKELEVWKHSSTMN 

NFHIRSKSSKGIFRSLIALFNTRJSYKTIDTNLNS7NTVLMQIDHVVKNISECYKSAF 

KDLSfNVTQNMQFHSFLQRUEMTVSGCPlTKCDPLIEYEVRFTILNGFLESLSSNTS 

KFKDNIILLRKEIQHLQA Yl YIYIHtVN 
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Oxytncha LCVS YILSSFYYANLEENALQFLRKESMDPEFCPETNLLMRLT 
Euplotes LCVSSILSSPYYATLEESSLGFLRDESMNPENPNVNLLMRJLT 



# # 

Figure 48 



ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 
GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 
TCTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 
TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 
ATA C AA CAACGT ACT AAC ATTT GG CT ATAAAATAG CT AG AAATG AAGATGT CAACAATAG 
TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 
GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 
T C AATTT AATGGG C AGTTTTTCACT CAAAT CGTGGGT AACAGATGTAACG AAC CT CAT CT 
GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 
ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 
TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 
GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 
TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 
GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 
ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTATTACCCCA 
AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 
AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 
GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 
GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 
ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 
TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 
CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 
AATGAGGATTATACCAAAAAAAAGTAATAATGAGTTCAGGATTATTGCCATCCCATGCAG 
AGGGGCAGACGAAGAAGAATTCACAATTTATAAGGAGAATCACAAAAATGCTATCCAGCC 
CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCCGACTAGTTTTACTAAAATATA 
TTCTCCAACGCAAATAGCTGACCGTATCAAAGAATTTAAGCAGAGACTTTTAAAGAAATT 
TAATAATGTCTT ACCAGAG CTTT ATTTCATGAAATTTG ATGTCAAAT CTTGCTATGATTC 
CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 
TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 
TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 
T CATTTAT CAAAT C AGG ATGTT ATAAACGTTGTAG AG ATGG AAAT ATTT AAAAC AGCTTT 
GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 
TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 
CAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 
AGACCAACAGCAAGTGATCAATATCAAAAAGCTTGCCATGGGCGGATTTCAAAAATATAA 
TGCGAAAGCCAATAGAGACAAAATTTTAGCCGTAAGCTCCCAATCAGATGATGATACGGT 
TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 
CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 
AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGACACAAATTTAAATTCAACAAA 
CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 
TTTTAAGG ATCT AT C AATT AATGTT ACG CAAAAT ATG C AATTTCATTCGTT CTTA C AACG 
CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 
TG AGGTACG ATTCACCATATTG AATGGATTTTTG G AAAG C CTAT CTTC AAAC AC ATCAAA 
ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 
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AKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKR 

VQLRDVSEAEVRQHREARPALLTSRLRFIPKPDGLRPIVNMDYVVGARTFRREKR 

AERLTSRVKALFSVLNYERA 



HI 
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GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 

TTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCJ 1 1 I 1CTACC 

GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAA 

GAGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGA 

AGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACG 

GGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAACGTTCCG 

CAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGC 

GTGCTCAACTACGAGCGGGCGCG 



Figure 51 



MTEHHTPKSRJLRFLENQYVYLCTLNDYVQLVLRGSPASSYSNICERLRSDVQTSFSIFLHSTVVGF 
DSKPDEGVQFSSPKCSQSELIANVVKQMFDESFERRRNLLMKGFSMNHEDFRAMHWGVQNDLV 
STFPNYLISILESKNWQLLLElIGSDAMHYLLSKGSIFEALPNDNYLQISGIPLFKKNVFEETVSfCKRX 
RTIETSITQNKSARKEVSWNS1SISRFSIFYRSSYK 

LINAFQVKQLHKVIPLVSQSTVVPKRLLKVYPLIEQTAKRLHRISLSKVYNHYCPYfDTHDDEKILS 

YSLKPNQVFAFLRSILVRVFPKLIWGNQRJFEnLIOJLETFLKLSRYESFSLHYLMSNlKISEIEWLVL 

GKilSNAKMCLSDFEKRKQEFAEFIYWLYNSFIIPILQSFFYITESSDLRNRTVYFRXDIW 

SMKMEAFEKINENNVRMDTQKTTLPPAVIRLLP^^ 

LRPVASILKHLWEESSCIPFNLEVYMKLLTFKJ<^^ 

IVKKJCLKDPEFVFRKYATIHATSDRATKNFVSEAFSYFDMVPFEKVVQLLSMKTSDTLFVDFVDY 
WTKSSSEIFKMLKEHLSGHIVKIGNSQYLQKVGIPQGSILSSFLCHFYMEDLIDEYLSFTKKKGSVL 
LRVVDDFLFITVNKKDAKKJLNLSLRGFEKJ^ 

FSVNMRSLDTLLACPfCIDEALFNSTSVELT?CHMGKSFFYKILRSSLASFAQVFIDITHNSKFNSCCNI 

YRLGYSMCMRAQAYLKRMKDIF1PQRMFITDLLNVIGRKIWKKJLAEILGYTSRRFLSSAEVKWLFC 

LGMRDGLfCPSFKYHPCFEOLIYQFQSLTDLIKPLRPVLRQVLFLHRRJAD 



Figure 52 

ggtaccgamacraccmcncataagctaattgcnccicgaacgctcctaaatctctggaaatamnacaagaactcaataacaataccaagtcaaanccaatatgaagg 

tgnanagtgatcgataatatnctatrrtatcggtcgnaccaagtataaggacaaaaagaacaacnccnccccctaaagacttttactnanaamactracaaatatamcg 

ggncgcnactmaaicgtggtactgtmagctgctacnctagccaaccgcgtgmctaccccgtcanggatatagctcnggagtagctcacagaaatccttacaaatcn 

ctgatgagactatanagancattacagtccgtgcatancnaacaiggagccttacactttagatgagtcacgtcgcatgatggagtatnggtatcatccaacgmgccng 

aaaaggngataattamgcaaaatcatgtccnagtggtggtaatccgcgaaagtttmgatgcngcacacgtctagcatgangagaiancaaaaamctatccactacaa 

ctccmaacgcggrmttmctatmctattctcatgngttccaaautgtatca^^ 

aaiaatctaaanagracgcnataangaagtagtagaaagattggtganctactcgtgtaatgnanagtttaaagatacragcaaaacamanagctatcanawtaaaa 
aaaatcctataanataaaiatiaatcaatatngcggtcaciatttatttaaaacgnatgatcagtaggacacmgcatatatatagnaigcnaatggttacttgtaacngcAT 
GACCGAACACCATACCCCCAAAAGCAGGATTCTTCGCTTTCTAGAGAATCAATATGTATACCTATGTA 

ccttaaatgattatgtacaacttgttttgagagggtcgccggcaagctcgtatagcaatatatgcgaa 
cgcttgagaagcgatgtacaaacgtccttttctatttttcttcattcgactgtagtcggcttcgacagt 

AAGCCAGATGAAGGTGTTCAATTTTCTTCTCCAAAATGCTCACAGTCAGAGgu^tatmigtntgatmntctancg 

ggatagctaatatatgggcagCTAATAGCGAATGTTGTAAAACAGATGTTCGATGAAAGTTTTGAGCGTCGAAGGA 

ATCTACTGATGAAAGGGTTTTCCATGgtaaggtanctaangtgaaatamacctgcaanactgrncaaagaga ngtattta accgaiaaagAA 

TCATGAAGATT-TTCGAGCCATGCATGTAAACGGA GTACA AAATGATCTCGTTTCTACTTTTCCTAATTA 

CCTTATATCTATACTTGAGTCAAAAAATTGGCAACTTTTGTTAGAAATgtaaataccggnaagatgngcgcacmgaaca 

aaacteacaaetatagTATCGGCAGTGATGCCATGCATTACTTATTATCCAAAGGAAGTATTTTTGAGGCTCTTC 

CAAATGACAATTACCTTCAGATTTCTGGCATACCACTTTTTAAAAATAATGTGTTTGAGGAAACTGTGT 

CAAAAAAAAGAAAGCGAACCATTGAAACATCCATTACTCAAAATAAAAGCGCCCGCAAAGAAGTTTC 

CTGGAATAGCATTTCAATTAGTAGGTTTAGCATTTTTTACAGGTCATCCTATAAGAAGTTTAAGCAAGgt 

aactaatactgnatccncataactaarmagATCTATATTTTAACTTACACTCTATTTGTGATCGGAACACAGTACACATG 

TGGCTTCAATGGATTTTTCCAAGGCAATTTGGACTTATAAACGCATTTCAAGTGAAGCAATTGCACAA 

AGTGATTCCACTGGTATCACAGAGTACAGTTGTGCCCAAACGTCTCCTAAAGGTATACCCTTTAATTGA 

ACAAACAGCAAAGCGACTCCATCGTATTTCTCTATCAAAAGTTTACAACCATTATTGCCCATATATTGA 

CACCCACGATGATGAAAAAATCCTTAGTTATTCCTTAAAGCCGAACCAGGTGTTTGCG-ITTCTTCGATC 

CATTCTTGTTCGAGTGTTTCCTAAATTAATCTGGGGTAACCAAAGGATATTTGAGATAATATTAAAAGg 

tangtataaaacnanaccactaacgartnaccagACCTCGAAACTTTCTTGAAATTATCGAGATACGAGTCTTTTAGTTTAC 

ATTATTTAATGAGTAACATAAAGgtaatatgccaaatrtttttaccartaanaacaatcagA I I l CAGAAATTGAATGGCTAGT 

CCTTGGAAAAAGGTCAAATGCGAAAATGTGCTTAAGTGATTTTGAGAAACGCAAGCAAATATTTGCGG 

A ATTCATCTACTGGCTATAC A ATTCGTTTATAATACCTATTTTACAATC 1 1 Hill I ATATC ACTGAATC 

AAGTGATTTACGAAATCGAACTGTTTATTTTAGAAAAGATATTTGGAAACTCTTGTGCCGACCCTTTAT 

TACATCAATGAAAATGGAAGCGTTTGAAAAAATAAACGAGgunttaaagtatnmgcaaaaagctaatattncagAACAA 

TGTTAGGATGGATACTCAGAAAACTACTTTGCCTCCAGCAGTTATTCGTCTATTACCTAAGAAGAATAC 

CTTTCGTCTCATTACGAATTTAAGAAAAAGATTCTTAATAAAGgtanaatmtggtcaicaatgtacmacnctaatctattanag 

caeATGGGTTCAAACAAAAAAATGTTAGTCAGTACGAACCAAACTTTACGACCTGTGGCATCGATACTG 

AAACATTTAATCAATGAAGAAAGTAGTGGTATTCCATTTAACTTGGAGGTTTACATGAAGCTTCTTACT 

TTTAAGAAGGATCTTCTTAAGCACCGAATGTTTGGgtaattatataatgcgcgancctcatianaatmgcagGCGTAAGAAG 

tattttgtacggatagatataaaatcctgttatgatcgaataaagcaagatttgatgtttcggattgtt 

AAAAAGAAACTCAAGGATCCCGAATTT GTAAT TCGAAAGTATGCAACCATACATGCAACAAGTGACCG 
AGCTACAAAAAACTTTGTTAGTGAGGCGTTTTCCTATTgtaagmatttmcanggaa 

GGTGCCTTTTGAAAAAGTCGTGCAGTTACTTTCTATGAAAACATCAGATACTTTGTTTGTTGATTTTGT 

GGATTATTGGACCAAAAGTTCTrCTGAAATTTTTAAAATGCTCAAGGAACATCTCTCTGGACACATTGT 

TAAGKtauccaangngaangmmcactaatgaaactagATAGGAAATTCTCAATACCTTCAAAAAGTTGGTATCCC'TC 

AGGGCTCAATTCTGTCATCTTTTTTGTGTCATTTCTATATGGAAGATTTGATTGATGAATACCTATCGTT 

TACGAAAAAGAAAGGATCAGTGTTGTTACGAGTAGTCGACGATTTCCTCTTTATAACAGTTAATAAAA 

AGGATGCAAAAAAATTTTTGAATTTATCTTTAAGAGgtgagngctgtcancciaagttctaaccgttgaagGATTTGAGAA 

acacaatttttctacgagcctggagaaaacagtaataaactttgaaaatagtaatgggataataaaca 
atactttttttaatgaaagcaagaaaagaatgccattcttcggtttctctgtgaacatgaggtctcttg 
atacattgttagcatgtcctaaaattgatgaagccttatttaactctacatctgtagagctgacgaaac 

ATATGGGGAAAT C1TI 1111 TACAAAATTCTAAGgtatactgtgtaactgaataatagctgacaaataatcagATCGAGCCTTGC 

ATCCTTTGCACAAGTATTTATTGACATTACCCACAATTCAAAATTCAATTCTTGCTGCAATATATATAG 

GCTAGGATACTCTATGTGTATGAGAGCACAAGCATACTTAAAAAGGATGAAGGATATATTTATTCCCC 

AAAGAATGTTCATAACGGgtgagtacnattttaaciagaaaagtcanaattaaccttagATCTTTTGAATGTTATTGGAAGAAAA 

ATTTGGAAAAAGTTGGCCGAAATATTAGGATATACGAGTAGGCGTTTCTTGTCCTCTGCAGAAGTCAA 




Figure 52 (cont.) 



ATGgiacgigtcggtctcgagacttcagcaatatigacacatcagGCl 1 I' I n'GTCTTGGAATGAGAGATGGTTTGAAACCCTCTT 
TCAAATATCATCCATGCTTCGAACAGCTAATATACCAATTTCAGTCATTGACTGATCTTATCAAGCCGC 
TAAGACCAG I I I 1 GCG AC AG GTGTTA1 J'l'J I'ACATAGAAGAATAGCTGATTAAtgtcatmcaatnattatatacatcctt 
tanactggtgtcnaaacaatananacmgtatagctgacccccaaagcaagcatactataggatnciagtaaagtaaaanaatctcgnariagtmgangacrigtctn 
atccnatacOTaagaaagaugacagtggugctgactactgcccacaigcccat^ 

aataaggaaagtggtmctataatgaa^tgcccgcaciaatgcaaaaagacgaaganatcnctaaacaagggggaaaagcatatccgaaggaaaagagagtaatat 
acccagtgngngaagaaagcaaggataatnggaacaagcttctgcagatgacaggctaaatmggtgaccgaatmggiaaaagccccaggttatccatggtggccg 
gccngctactgagacgaaaagaaactaaggatagmgaatactaatagctcatttaatgtcnatataaggtmgr^^ 

gtgnaagccanarrgganccgaaaugccaaantcnggttcctcaaagcggaagtctaaagaacttangaagcnatgaggcttcaaaaactcctcctgatnaaaggag 
gaaicttccaccgatgaggaaatggauigcttatcagctgctgaggagaagcciaainntgcaaaaaagaaaataicangggagacatctcngatgaaicagatgcgga 
gagiaictccagcggatccugatgtcaataacnctamctgaaatgtatggtcctactgtcgcttcgacrtctcgtagctctacgcagttaagtgaccaaaggtacc 
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EST2 pep 

Euplotes pep 

Trans of tietrahymen 

Consensus 



FFYCTSISST VTIVYFRKDT WN KL1T I' FIVE YFK-TYLVEN 

FFYVTEOQKS YSKTYYYRKN IWDVI-MKMS IAD LKK ETLA — EVQE 

KHKE GSQIFYYRKP IWKLVSKLTI VKVRIQFSEK NKQMKNNFYQ 



FFY.TE. . K. .S..YYYRK. IW. 



-KL. 



F . . K 



.V. . 



40 
43 

50 



EST2 pep 

Euplotes pep 

Trans of cetrahymen 

Consensus 



TLSNFNH^t 



NVCRNKNSY- 

KEVEEWKKSL 

KIQLEEENLE KVEEKLIPED SFQKYPQdKp 



-GFAPGKE& 




r.RK 



79 
78 
92 

100 



EST2 pep 

Euplores pep 

Trans of tetrahyroen 

Consensus 



AOEESFTIYX EttHKKAIQmQKILE^IlRNK 
IVNSbRKTTK LTTN^<LLNS^^MUa i - 
DKQKNIK--- U^n^II^S^QLVFRJsi<ri- 

. . . . ! .K. . K LN.N. .L . .S^QL.L. JLkS- 



RPTSFTKIYS PTQIADRIKE 129 

RMFK -CPFGFAVFN 120 

ML-G -QXIGYSVFD 130 

-..IG..VF. 150 



EST2 pep 

Euploces pep 

Trans of tecrahymen 

Consensus 



FKQKL 
YD-D 1 
NK -01 

X- 




NVL -fl 

EFVCKWKQVG 
QFIEKWKNKG F 



PEfc 



. F. .KWK. . G .fe 



YFMKFD VKSCYD 
ATMD "IEKCYD 
IfYVTL- 



fe&T.T.D 



. CYD 



157 
155 
158 

186 
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S-l: FFY VTE TTF QKN RLF FYR KSV WSK 
S-2: RC>H LKR VQL RDV SEA EVR QHR EA 
S-3: ART FRR EKR AER LTS RVK ALF SVL NYE 

A-l: AJCF LHW LMS VYV VEL LRS FFY VTE TTF Q . 
A-2: LFF YRK SVW SKL QSI GJR QHL KRV QLR DVS 
A-3: PAL LTS RLR FTP KPD GLR PIV NMD YW 
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A 



Vector 



Genomic DNA Insert 



Vector 



A5 



B2 



5525 bp Sequenced 
-2 kb Hind ill Fragment 



1 kb 



C3 
Q 

ru 

m 
a 
m 

!■* 

ry 
ru 
o 

o 



B 



RT Motifs 1 2 3(A) 4(B') 5(C) 6(D) 

1/1 \ I J0 ^ 0 "^ 



Introns 



2 3 




6 7 8 9 10 11 



Hind 11! Xca 



Original PCR 
3' RT-PCR 



RT-PCR w/ M2-B14 
RT-PCR w/ M2-B15 Bajn< 
RT-PCR w/ M2-B15 Bern 
RT-PCR w/ M2-B16 Bajn 



12 13 14 15 



Xca I 



Hind III 



500 bp 
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Poly 4 

t t c 

t a a g c c teg 
5'- cag acc aaa gga att cca taa gg -3 
QTKGIPQG 



4<B') 



a 
a 

.n 
m 

m 
m 

I 5 (C) 

M D D Y Ij L I T 

I'll 3'- ctg ccg atg gag gag cag egg -5' 

;'LJ aaaaaaaaa 
O c t t t 

P c c 

Poly 1 



Figure 57 




II 
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PCR Product M2 showed Reasonable Match 
with Other Teiomerase Proteins 



Ot IiCVSYILSSFYYANI^EENALQFLRKESMDPEKPETNLLMRLT 

Ea_ p!23 KGIPQGIiCVSSXIiSSFYYATIiEESSIiGFIiRDESMNPENPNVNLIjMRIjTDDYIiIjIT 

Sp_M2 SILSSFLCHFYMEDLIDEYLSFTKKK GSVLLRW 

Sc_pl03 DGLFQGSSLSAPXVDI-VYDDIiLEFYSEFKASPS QDTLILKLADDFLIIS 



QKVGIPQG 

caa aaa get ggc acc ccc cag gg 

=Fblv 4 

%j C C C 

£[J a a g c c c c g 

itiag acc aaa gga ace cca caa gg > 

?%g acc aaa gga acc cca cca ggC TCA ATT 

frfcc egg etc ccc caa ggc age ccG AGT TAA 

L. K G : P S G S r 



< Actual Genomic Sequence. 



CTG TCA TCT TTT TTG TGT CAT TTC TAT ATG 
GAC AGT AGA AAA AAC ACA GTA AAG ATA TAC 

LSSFLCHFYM 



6JLa GAT TTG ATT 
<p$T CTA AAC TAA 

£ D L I 



GAT GAA TAC CTA 
CTA CTT ATG GAT 

D E Y L 



TCG TTT ACG AAA 
AGC AAA TGC TTT 

S F T K 



AAG AAA GGA TCA 
TTC TTT CCT AGT 

K K G S 



GTG TTG TTA CGA 
CAC AAC AAT GCT 

V L L R 



GTA GTC gac 
CAT CAG ctg 

V V d 

< CC g 

a 



gac cac ccc 
ctg acg gag 

D Y I* 

ctg acg gag 
a aaa 
c 



ccc acc acc 
gag cag egg 

l r t 

gag cag egg 
a a a a 
ccc 
c c 
Poly 1 



gac gac ccc ccc ccc aca aca 
D 0 F L F I T 



< Accual Genomic Sequence. 
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3' RT PCR Strategy 



mRNA 



I AAAAAAAAAAAAAAAA 




o 

H 

m 
u 

in 
a 

m 

u 

m 
ru 

O 

a 



1 Synthesis of cONA with Qj Primer. 
mRNA 



5' 

y 



2. First Round PCR Using Outside Primer and Qq Primer. 



I AAAAAAAAAAAAAAAA , 

i n ' mnrmri ' i'i ■ 



3. Second Round PCR Using inside Pnmer and Q\ Primer 



I ' rrrrrrr rrf i - ri ' t x 1 



4. Sequence Second Round PCR Products Using Inside Pnmer or Primer. 
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-Size Selected Libraries from P. Nurese 
3 - 4 kb 
5 - 6 kb 
7 - B kb 
11 ~ 12 kb 

-Libraries from J, A. Wise 
Sau 3a Partial Digest 
Hind 111 Partial Digest 



cDNA Libraries 



GAD (Gal Activation Domain) Library 
REP Library from R. Allshire 
REP81ES Library (old) 
REP81ES Library (new) 
REP41ES Library 



B 



L Original PCR 
3' RT-PCR 



4 f 



500 bp 



"£ P. Nurse 
° r 




EsF- r^*^" "lot fc* ' ; 
ftjv * - - *v— t*v. < - 



D 



Mg Cone. 



c 
o 
u 

ri> 
o 
c 



a 
O 

CO 



"O 
C 



< 

a 
o 

1< 

O Q 

CD u 



603- 

310- 
234- 
194- 
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# 
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5' RT PCR Strategy 



5' 



mRNA 



I AAAAAAAAAAAAAAAA 



ddA- 



-P 



1 . Synthesis of cDNA witfi Specific Downstream Primer. 

mRNA 



5* 
3' 



I AAAAAAAAAAAAAAAA 



O 
Q 
H 

m 

in 
o 
m 



2. bgote Oligo with S'-P and blocked i' to cDNA using U RNA Ugase. 



3. First Round PCR 



ry 
m 
a 

12 4 - Second Round PCR 
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Alignment of RT Domains from Telomerase Catalytic Subunifs. 

Motif 0 

S.D. Tezip (429). WLYNSFIIPILQSFFYITESSDLRNRT\. r YFRXDIW ...(35)... 

S.c. Est2p (366) . WLFRQL I PK I IQTFFYCTEISSTVT - I YYFRHDTW . . . (35) . . . 

E.a. pi23 (441) . WIFEDL WSL I RCFF YVTEQQKS YSKTYYY RKN I W . . . (35) . . . 

* ***** * * * 

Motif 1 Motif 2 K 

p nh h K hR h R 

S.p. Tezip AVIRLLPKK--NTFRLITN-LRKRF ...(61)... 

S.c. Est2p SKMRIIPKKSNNEFRIIAIPCRGAD ...(62)... 

E.a. p!23 GKLR1IPKK--TTFRPIMTFNKKIV ...(61)... 
«■ * * * *■ * * 

Motif 3(A) AF 

h hDh GY h 

S.p. Tezip rKYFVRIDIKSCYDRIKQDLMFRIVK . . . (89) . . . 

S.c. Est2p E 1 Y FMKFDVK S C YD SIP RMECMR ILK . . . (75) . . . 

E.a. pi23 K1FFATMDIEKCYDSVNREKLSTFLK ...(107)... 
■*■*-*** * 

Motif 4{B') 

hPQG pP hh h 

S.p. Tezip YLQKVG I PQGS I LS SFLCHF YMEDL IDE YLS F ...(6)... 

S.c. Est2p YIREDGLFQGSSLSAPIVDLVYDDLLEFYSEF ...(8)... 

E.a. pl23 YKQTKG I PQGLCV S S 1 1 S S F YY ATLE ES S LG F ...(14)... 
***** * * 

Y Motif 5(C) Motif 6(D) 

h F DDhhh Gl; h cK h 

S.p. Tezip VI,LRVVDDFLFITVNKKD/JCKFLNLSLRGFEKHi\ ? FSTSLEKTVINFENS .(205) 
S.c. Est2p LILKLADDFLIISTDQQQVINIKKI^GGFQKY:vAKA:4RDKILAVSSQS - (173) 
E.a. p!23 LlMRlTDDYLLITTQEETOAVLFIEKLINVSRENGrrlFN^KLQTSFPLS - (209) 
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Figure 65 

Disruption strategy for the putative teiomerase genes. 
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1 . Transform with linear fragment 
containing the teiomerase gene 
disrupted with a LEU2 or ura4 marker. 
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2. Assay in selective media. 

3. Sporulate, and grow on 
selective media. 
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(These cells will show a senescence phenotype 

if the disrupted gene encodes a teiomerase subunit.) 
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An Example of Confirmation of tezl disruption By PCR 
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Tezl disruption causes progressive 
shortening of telomeres in S. pombe 
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Figure 68 



1 

met ser val tyr val val glu leu leu 
GCCAAGTTCCTGCACTGGCTG ATG AGT GTG TAC GTC GTC GAG CTG CTC 



arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 

AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG 

30 

leu phe phe tyr arg lys ser val trp ser lys leu gin ser ile 

CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT 

40 50 

gly ile arg gin his leu lys arg val gin leu arg glu leu ser 

GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 



glu ala giu val arg gin his arg glu ala arg pro ala leu leu 
GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG CTG 



thr ser arg ieu arg phe ile pro lys pro asp gly leu arg pro 
ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG CCG 

90 

ile val asn met asp tyr val val gly ala arg thr phe arg arg 

ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC AGA 

100 110 

glu lys ala glu arg leu thr ser arg val lys ala leu phe 

GAA AAG ARG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG TTC 

120 

ser val leu asn tyr glu arg ala arg arg pro gly leu leu gly 

AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG GGC 

130 140 

ala ser val leu gly leu asp asp ile his arg ala trp arg thr 

GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC ACC 

150 

phe val leu arg val arg ala gin asp pro pro pro glu leu tyr 

TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG TAC 

160 170 

phe val lys val asp val thr gly ala tyr asp thr ile pro gin 

TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC CAG 

180 

asp arg leu chr glu val ile ala ser ile ile lys pro gin asn 

GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 



10 



20 



60 



70 



80 



Figure 68 (com.) 



190 

thr tyr cys val arg 
ACG TAC TGC GTG CGT 



gly thr ser ala arg 
GGC ACG TCC GCA AGG 

220 

gin gly ile pro gin 
CAG GGG ATC CCG CAG 



leu cys tyr gly asp 
CTG TGC TAC GGC GAC 

250 

arg asp gly leu leu 
CGG GAC GGG CTG CTC 



thr pro his leu thr 
ACA CCT CAC CTC ACC 

280 

arg gly val pro glu 
CGA GGT GTC CCT GAG 



val val asn phe pro 
GTG GTG AAC TTC CCT 

310 

phe val gin met pro 
TTT GTT CAG ATG CCG 



leu leu asp thr arg 
CTG CTG GAT ACC CGG 

340 

tyr ala arg thr ser 
TAT GCC CGG ACC TCC 



phe lys ala gly arg 
TTC AAG GCT GGG AGG 

370 

arg leu lys cys his 
CGG CTG AAG TGT CAC 



arg tyr ala val val 
CGG TAT GCC GTG GTC 

210 

pro ser arg ala thr 
CCT TCA AGA GCC ACG 



gly ser ile leu ser 

GGC TCC ATC CTC TCC 

240 

met glu asn lys leu 

ATG GAG AAC AAG CTG 



leu arg leu val asp 
CTG CGT TTG GTG GAT 

270 ' 

his ala lys thr phe 
CAC GCG AAA ACC TTC 



tyr gly cys val val 
TAT GGC TGC GTG GTG 

300 

val glu asp glu ala 
GTA GAA GAC GAG GCC 



ala his gly leu phe 
GCC CAC GGC CTA TTC 

330 

thr leu glu val gin 
ACC CTG GAG GTG CAG 



ile arg ala ser leu 
ATC AGA GCC AGT CTC 

360 

asn met arg arg lys 
AAC ATG CGT CGC AAA 



ser leu phe leu asp 
AGC CTG TTT CTG GAT 



200 

gin lys ala ala met 
CAG AAG GCC GCC ATG 



ser tyr val gin cys 
TCC TAC GTC CAG TGC 

230 

thr leu leu cys ser 
ACG CTG CTC TGC AGC 



phe ala gly ile arg 
TTT GCG GGG ATT" CGG 

260 

asp phe leu leu val 
GAT TTC TTG TTG GTG 



leu arg thr leu val 
CTC AGG ACC CTG GTC 

290 

asn leu arg lys thr 
AAC TTG CGG AAG ACA 



leu gly gly thr ala 
CTG GGT GGC ACG GCT 

320 

pro trp cys gly leu 
CCC TGG TGC GGC CTG 



ser asp tyr ser ser 
AGC GAC TAC TCC AGC 

350 

thr phe asn arg gly 
ACC TTC AAC CGC GGC 



leu phe gly val leu 
CTC TTT GGG GTC TTG 

380 

leu gin val asn ser 
TTG CAG GTG AAC AGC 
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leu gin thr val cys 
CTC CAG ACG GTG TGC 

400 

ala tyr arg phe his 
GCG TAG AGG TTT CAC 



gin val trp lys asn 
CAA GTT TGG AAG AAC 

430 

arg leu pro leu leu 
CGG CTC CCT CTG CTA 



val ala gly gly gin 
GTC GCT GGG GGC CAA 

460 

ara ala val ala val 
CGT GCA GTG GCT GTG 



thr pro cys his leu 

ACA CCG TGT CAC CTA 

490 

pro asp ala ala giu 

CCA GAC GCA GCT GAG 



oro aly gly arg ser 
CCT GGA GGC CGC AGC 

520 

his pro gly leu met 
CAT CCT GGA CTG ATG 



thr pro ala ala leu 
ACA CCA GCA GCC CTG 

550 

arg gly gly pro his 
AGG GGC GGC CCA CAC 



390 

thr asn ile tyr lys 
ACC AAC ATC TAC AAG 



ala cys val leu gin 
GCA TGT GTG CTG CAG 

420 

pro his phe ser cys 
CCA CAT TTT TCC TGC 



leu his pro glu ser 
CTC CAT CCT GAA AGC 

450 

gly arg arg arg pro 
GGG CGC CGC CGG CCC 



pro pro ser ile pro 
CCA CCA AGC ATT CCT 

480 

arg ala thr pro gly 
CGT GCC ACT CCT GGG 



ser glu ala pro gly 
TCG GAA GCT CCC GGG 

510 

gin pro gly thr ala 
CAA CCC GGC ACT GCC 



ala thr arg pro gin 
GCC ACC CGC CCA CAG 

540 

ser arg arg ala tyr 
TCA CGC CGG GCT TAT 



pro gly leu his arg 
CCA GGC CTG CAC CGC 



ile leu leu leu gin 
ATC CTC CTG CTG CAG 

410 

leu pro phe his gin 
CTC CCA TTT CAT CAG 



ala ser ser leu thr 

GCG TCA TCT CTG ACA 

440 

gin glu arg arg asp 

CAA GAA CGC AGG .GAT 



ser ala leu arg gly 

TCT GCC CTC CGA GGC 

470 

ala gin ala asp ser 

GCT CAA GCT GAC TCG 



val thr gin asp ser 
GTC ACT CAG GAC AGC 

500 

asp asp ala asp cys 
GAC GAC GCT GAC TGC 



leu arg leu gin asp 
CTC AGA CTT CAA GAC 

530 

pro gly arg glu gin 
CCA GGC CGA GAG CAG 



thr ser gin gly gly 
ACG TCC CAG GGA GGG 

560 

trp glu ser glu ala 
TGG GAG TCT GAG GCC 



564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 
CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 
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AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 
CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 
TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 
AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 
GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 
TTT C AGTTTTGG AAAAAAAAAAAAAAAAAAAAAAAAAA 
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Motif -1 

Ep pi 23 ...LVVSLIRCFFYVTEQQfCSYSKT... 

Sp Tezl ...FIIPILQSFFYITESSDLRNRT... 

Sc Est2 ...LIPKIIQTFFYCTEISSTVTIV... 

Hs TCP 1 ... YVVELLRSFFYVTETTFQKNRL... 
consensus FFY TE 

K 

Motif 0 phhh K hRh R 

Ep P 123 ...KSLGFAPGKLRLIPKKT--TFRPIMTFNKKIV.. 

Sp Tezl ...QKTTLPPAVIRLLPKKN--TFRLITNLRKRJFL.. 

Sc Est2 ...TLSNFNHSKMRIIPKKSNNEFRIIAIPCRGAD. 

Hs TCP 1 . . . ARP ALLTSRLRFIPKP D— GLRPI VNMD YV VG 
consensus R PK R I 

AF 

Motif A h hDh GY h 

Ep p 123 ...PKLFFATMDIEKCYDSVNREKLSTFLK... 

Sp Tezl ...RKKYFVRIDIKSCYDRIKQDLMFRIVK... 

Sc Est2 ...PELYFMKFDVKSCYDSIPRMECMRILK... 

Hs TCP1 ... PEL YFVKVDVTGAYDTIPQDRJLTEVIA. ..//... 
consensus F D YD 

Motif B hPQG pS hh 

Ep p i 23 ...NGKPYKQTKGIPQGLCVSSILSSFYYA... 

Sp Tezl ...GNSQYLQKVGIPQGSILSSFLCHFYME... 

Sc Est2 ...EDKCYIREDGLFQGSSLSAPIVDLVYD... 

Hs TCP1 ...RATSYVQCQGIPQGSILSTLLCSLCYG... 
consensus G QG S 

Y 

Motif C h FDDhhh 

Eppl23 ...PNVNLLMRLTDDYLLITTQENN... 

Sp Tezl ...KKGSVLLRVVDDFLFITVNKKD... 

Sc Est2 ...SQDTLILKLADDFLIISTDQQQ... 

Hs TCP 1 ...RRDGLLLRLVDDFLLVTPHLTH... 
consensus DD L 

Motif D Ghh cK 

Ep p 1 23 ...N VSRENGFKFNMKKL... 

SpTezl ...LNLSLRGFEKHNFST... 

Sc Est2 ...KKLAMGGFQKYNAKA... 

Hs TCP 1 ...LRTLVRGVPEYGCVV... 
consensus G 
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BSPHI 6978, 
APAL1 6310 



DRA3 225 ' 

XKOl 6 74 
3C0R5 7 01 
ECOR1 707 
fooTl 713 
MSC1 852 



3SPH1 5970. 



AFAI-1 5564 



ECOR1 4798 
S AC 2 4 77 9 




SAC2 4844. 
MOT1 4! 
BAMH1 
3MA1 



SHA1 143 0 
SMA1 1550 



SMA1 2 0 53 



MSC1 2397 



XHOl 27 4 0 
EC OR 5 2828 



MSC1 4174 
SMA1 4 09 5 

APAL1 3766^ 

SAC2 3658* 



DRA3 3 221 
BAMH1 32 6 4 
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I GCAGCGCTGC GTCCTGCTGC GCACGTGGGA AGCCCTGGCC CCGGCCACCC 

5 1 CCGCGATGCC GCGCGCTCCC CGCTGCCGAG CCGTGCGCTC CCTGCTGCGC 

1 0 1 AGCCACTACC GCG AGGTGCT GCCGCTGGCC ACGTTCGTGC GGCGCCTGGG 

1 5 1 GCCCCAGGGC TGGCGGCTGG TGCAGCGCGG GGACCCGGCG GCTTTCCGCG 

201 CGNTGGTGGC CCANTGCNTG GTGTGCGTGC CCTGGGANGN ANGGCNGCCC 

25 1 CCCGCCGCCC CCTCCTTCCG CCAGGTGTCC TGCCTGAANG ANCTGGTGGC 

30 1 CCGAGTGCTG CANANGCTGT GCGANCGCGG CGCGAANAAC GTGCTGGCCT 

- 351 TCGGCTTCGC GCTGCTGGAC GGGGCCCGCG GGGGCCCCCC CG AGGCCTTC 

40 1 ACCACCAGCG TGCGCAGCTA CCTGCCCAAC ACGGTGACCG ACGCACTGCG 

451 GGGGAGCGGG GCGTGGGGGC TGCTGCTGCG CCGCGTGGGC GACGACGTGC 

5 0 1 TGGTTC ACCT GCTGGC ACGC TGCGCGNTNT TTGTGCTGGT GGNTCCC AGC 

55 1 TGCGCCTACC ANGTGTGCGG GCCGCCGCTG TACCAGCTCG GCGCTGCNAC 

60 1 TCAGGCCCGG CCCCCGCCAC ACGCTANTGG ACCCGAANGC GTCTGGGATC 

65 1 CAACGGGCCT GGAACCATAG CGTCAGGGAG GCCGGGGTCC CCCTGGGCTG 

70 1 CCAGCCCCGG GTGCGAGGAG GCGCGGGGGC AGTGCCAGCC GAAGTCTGCC 

75 1 GTTGCCCAAG AGGCCCAGGC GTGGCGCTGC CCCTGAGCCG GAGCGGACGC 

80 1 CCGTTGGGCA GGGGTCCTGG GCCCACCCGG GCAGGACGCC TGGACCGAGT 

85 1 GACCGTGGTT TCTGTGTGGT GTCACCTGCC AGACCCGCCG AAGAAGCCAC 

90 1 CTCTTTGGAG GGTGCGCTCT CTGGC ACGCG CCACTCCCAC CCATCCGTGG 

95 1 GCCGCCAGCA CCACGCGGGC CCCCC ATCCA CATCGCGGCC ACCACGTCCT 

1 00 1 GGGACACGCC TTGTCCCCCG GTGTACGCCG AGACCAAGCA CTTCCTCTAC 

! 05 1 TCCTCAGGCG ACAAGNACAC TGCGNCCCTC CTTCCTACTC AATATATCTG 

1101 AGGCCCAGCC TG ACTGGCGT TCGGGAGGTT CGTGGAGAC A NTCTTTCTGG 

1151 TTCCAGGCCT TGGATGCCAG GATTCCCCGC AGGTTGCCCC GCCTGCCCCA 

1 20 1 GCGNTACTGG C AAATGCGGC CCCTGTTTCT GGAGCTGCTT GGGAACCACG 

1 25 1 CGCAGTGCCC CTACGGGGTG TTCCTCAAGA CGCACTGCCC GCTGCGAGCT 

1301 GCGGTCACCC CAGCAGCCGG TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 

1351 TGTGGCGGCC CCCGAGGAGG AGGAACACAG ACCCCCGTCG CCTGGTGCAG 

140 1 CTGCTCCGCC AGCACAGCAG CCCCTGGCAG GTGTACGGCT TCGTGCGGGC 

145 1 CTGCCTGCGC CGGCTGGTGC CCCCAGGCCT CTGGGGCTCC AGGCACAACG 

1501 AACGCCGCTT CCTC AGGAAC ACCAAGAAGT TCATCTCCCT GGGGAAGCAT 

1551 GCCAAGCTCT CGCTGCAGGA GCTGACGTGG AAGATGAGCG TGCGGGACTG 

1 60 1 CGCTTGGCTG CGCAGGAGCC C AGGGGTTGG CTGTGTTCCG GCCGC AGAGC 

1 65 1 ACCGTCTGCG TGAGGAGATC CTGGCCAAGT TCCTGCACTG GCTGATGAGT 

1 70 1 GTGTACGTCG TCGAGCTGCT C AGGTCTTTC TTTTATGTC A CGGAG ACC AC 

1 75 1 GTTTCAAAAG AACAGGCTCT TTTTCTACCG GAAGAGTGTC TGGAGCAAGT 

1 80 1 TGCAAAGCAT TGGAATCAGA CAGCACTTGA AGAGGGTGCA GCTGCGGG AG 

1 851 CTGTCGGAAG CAGAGGTCAG GCAGCATCGG GAAGCCAGGC CCGCCCTGCT 

1 90 1 GACGTCC AGA CTCCGCTTCA TCCCCAAGCC TGACGGGCTG CGGCCGATTG 

195 1 TGAACATGGA CTACGTCGTG GGAGCCAGAA CGTTCCGCAG AGAAAAGAGG 

200 1 GCCG AGCGTC TC ACCTCG AG GGTGAAGGCA CTGTTCAGCG TGCTCAACTA 

205 1 CGAGCGGGCG CGGCGCCCCG GCCTCCTGGG CGCCTCTGTG CTGGGCCTGG 

2101 ACG ATATCC A CAGGGCCTGG CGC ACCTTCG TGCTGCGTGT GCGGGCCC AG 

2151 GACCCGCCGC CTGAGCTGTA CTTTGTCAAG GTGGATGTGA CGGGCGCGTA 

220 1 CG AC ACC ATC CCCCAGGACA GGCTCACGGA GGTCATCGCC AGCATCATC A 

225 1 AACCCC AGAA C ACGTACTGC GTGCGTCGGT ATGCCGTGGT CC AGAAGGCC 
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230 1 GCCCATGGGC ACGTCCGCAA GGCCTTCAAG AGCCACGTCT CTACCTTGAC 
235 1 AGACCTCCAG CCGTACATGC GACAGTTCGT GGCTCACCTG CAGGANAACA 
240 1 GCCCGCTGAG GGATGCCGTC GTCATCGAGC AGAGCTCCTC CCTGAATGAG 
245 1 GCCAGCAGTG GCCTCTTCGA CGTCTTCCTA CGCTTCATGT GCCACCACGC 
250 1 CGTGCGCATC AGGGGCAAGT CCTACGTCCA GTGCC AGGGG ATCCCGCAGG 
255 1 GCTCC ATCCT CTCCACGCTG CTCTGCAGCC TGTGCTACGG CGACATGGAG 
260 ! AAC AAGCTGT TTGCGGGGAT TCGGCGGGAC GGGCTGCTCC TGCGTTTGGT 
265 1 GGATG ATTTC TTGTTGGTGA CACCTCACCT CACCCACGCG AAAACCTTCC 
270 1 TCAGGACCCT GGTCCG AGGT GTCCCTGAGT ATGGCTGCGT GGTGAACTTG 
2751 CGGAAGACAG TGGTGAACTT CCCTGTAGAA GACGAGGCCC TGGGTGGCAC 
280 1 GGCTTTTGTT CAGATGCCGG CCCACGGCCT ATTCCCCTGG TGCGGCCTGC 
285 1 TGCTGGATAC CCGGACCCTG GAGGTGCAGA GCG ACTACTC C AGCTATGCC 
290 1 CGGACCTCC A TCAGAGCCAG TCTCACCTTC AACCGCGGCT TCAAGGCTGG 
295 1 GAGGAACATG CGTCGCAAAC TCTTTGGGGT CTTGCGGCTG AAGTGTCACA 
3001 GCCTGTTTCT GGATTTGCAG GTGAACAGCC TCCAGACGGT GTGCACCAAC 
305 ! ATCTACAAGA TCCTCCTGCT GCAGGCGTAC AGGTTTCACG CATGTGTGCT 
3101 GCAGCTCCCA TTTCATCAGC AAGTTTGGA A G AACCCC ACA TTTTTCCTGC 
3151 GCGTCATCTC TGACACGGCC TCCCTCTGCT ACTCCATCCT GAAAGCCAAG 
3201 AACGCAGGGA TGTCGCTGGG GGCCAAGGGC GCCGCCGGCC CTCTGCCCTC 
325 1 CGAGGCCGTG CAGTGGCTGT GCCACC AAGC ATTCCTGCTC AAGCTG ACTC 
3301 GACACCGTGT CACCTACGTG CC ACTCCTGG GGTC ACTC AG GAC AGCCCAG 
335 1 ACGCAGCTGA GTCGGAAGCT CCCGGGGACG ACGCTGACTG CCCTGGAGGC 
340 1 CGCAGCC AAC CCGGCACTGC CCTCAGACTT C A AG AC CATC CTGG ACTG AT 
345 1 GGCCACCCGC CCACAGCC AG GCCGAGAGC A GACACCAGCA GCCCTGTCAC 
3501 GCCGGGCTCT ACGTCCCAGG GAGGGAGGGG CGGCCCACAC CCAGGCCCGC 
355 1 ACCGCTGGGA GTCTGAGGCC TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 
3601 CCGGCTGAAG GCTGAGTGTC CGGCTGAGGC CTGAGCGAGT GTCCAGCC AA 
365 1 GGGCTGAGTG TCCAGCAC AC CTGCCGTCTT CACTTCCCCA CAGGCTGGCG 
370 1 CTCGGCTCCA CCCCAGGGCC AGCTTTTCCT CACCAGGAGC CCGGCTTCCA 
375 1 CTCCCCACAT AGGAATAGTC CATCCCCAGA TTCGCCATTG TTC ACCCCTC 
3801 GCCCTGCCCTCCTTTGCCTTCCACCCCCACCATCCAGGTG GAGACCCTGA 
385 1 GAAGGACCCT GGGAGCTCTG GGAATTTGGA GTGACCAAAG GTGTGCCCTG 
3901 TACACAGGCG AGGACCCTGC ACCTGGATGG GGGTCCCTGT GGGTCAAATT 
3951 GGGGGGAGGT GCTGTGGGAG TAAAATACTG AATATATGAG TTTTTCAGTT 
4001 TTGAAAAAAA AAAAAAAAAA AAAAAAAAA 
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Figure 74 



1 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his tyr arg glu val ieu pro leu ala thr phe val arg arg leu 
CAC TAG CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCC- GCT 

50 6C 
phe arg ala leu val ala gin cys leu val cys val pro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 90 
leu lys glu leu val ala arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG 'CTG GAC GGG 

110 120 
ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 



13C 

tyr leu pro asn thr val thr asp ala leu arg gly ser glv ala 
TAG CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 
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140 



150 



t glv leu leu ieu arg arg val gly asp asp val leu val his 

TGG GGG CTG CTG CTG CGC CGC GTG GGC GAC GAC GTG CTG GTT CAC 

150 

"leu leu ala arg cys ala leu phe val leu val ala pro ser cys 

CTG CTG GCA CGC TGC GCG C?C TTT GTG CTG GTG GCT CCC AGC TGC 

170 ISO 

ala tyr gin val cys gly pro pro leu tyr gin leu giy ala ala 

GCC TAC CAG GTG TGC GGG CCG CCG CTG TAC CAG CTC GGC GCT GCC 

190 

zhr gin ala arg pro pro pro his aia ser giy pro arg arg arg 

ACT CAG GCC CGG CCC CCG CCA CAC GCT AGT GGA CCC CGA AGG CGT 

200 210 

leu gly cys glu arg ala trp asn his ser val arg giu ala giy 

CTG GGA TGC GAA CGG GCC TGG AAC CAT AGC GTC AGG GAG GCC GGG 

220 

val pro leu glv ieu oro ala pro gly ala arg arg arg gly gly 

GTC CCC CTG GGC CTG CCA GCC CCG GGT GCG AGG AGG CGC GGG GGC 

230 240 

ser ala ser arg ser leu pro leu pro lys arg pro arg arg gly 

AGT GCC AGC CGA AGT CTG CCG TTG CCC AAG AGG CCC AGG CGT GGC 

250 

ala aia oro giu pro glu arg thr pro val gly gin giy ser rrp 

GCT GCC CCT GAG CCG GAG CGG ACG CCC GTT GGG CAG GGG TCC TGG 

250 270 

ala his oro giy arg chr arg gly pro ser asp arg gly phe cys 

GCC CAC CCG GGC AGG ACG CGT GGA CCG AGT GAC CGT GGT TTC TGT 

280 

val val ser pro ala arg pro ala glu glu ala chr ser leu giu 

GTG GTG TCA CCT GCC AGA CCC GCC GAA GAA GCC ACC TCT TTG GAG 
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290 

giy aid leu ser gly thr arg his 
GGT GCG CTC TCT GGC ACG CGC CAC 



gin his his ala gly pro pro ser 
CAG CAC CAC GCG GGC CCC CCA TCC 

320 

trp asp thr pro cys pro pro val 
TGG GAC ACG OCT TGT CCC CCG GTG 



leu tyr ser ser gly asp lys glu 
CTC TAC TCC TCA GGC GAC AAG GAG 

350 

leu ser ser leu arg pro ser leu 
CTC AGC TCT CTG AGG CCC AGC CTG 

glu thr lie phe leu gly sar arg 
GAG ACC ATC TTT CTG GGT TCC AGG 

380 

arg arg leu pro arg leu pre gin 
CGC AGG TTG CCC CGC CTG CCC CAG 

leu phe leu glu leu leu gly asn 
CTG TTT CTG GAG CTG CTT GGG AAC 

410 

val leu leu lys thr his cys pro 
GTG CTC CTC AAG ACG CAC TGC CCG 



ala ala gly val cys ala arg glu 
GCA GCC GGT GTC TGT GCC CGG GAG 



300 

ser his pro ser val gly arg 
TCC CAC CCA TCC GTG GGC CGC 

310 

thr ser arg pro pro arg pro 
ACA TCG CGG CCA CCA CGT CCC 

330 

tyr ala glu thr lys his phe 
TAC GCC GAG ACC AAG CAC TTC 



gin leu arg pro ser phe leu 
CAG CTG CGG CCC TCC TTC CTA 

360 

thr gly aia arg arg leu val 
ACT GGC GCT CGG AGG CTC GTG 

370 

pro trp met pre gly thr pro 
CCC TGG ATG CCA GGG ACT CCC 

390 

arg tyr trp gin met arg pro 
CGC TAC TGG CAA ATG CGG CCC 

400 

his aia gin cys pro tyr gly 
CAC GCG CAG TGC CCC TAC GGG 

420 

leu arg aia aia val thr pro 
CTG CGA GCT GCG GTC ACC CCA 

430 

lys pro gin gly ser val ala 
AAG CCC CAG GGC TCT GTG GCG 
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440 450 
aia pre glu glu glu asp thr asp pro arg arc lau val gin leu 
GCC CCC GAG GAG GAG GAG ACA GAC CCC CGT CGC CTG GTG CAG CTG 

460 

leu arg gin his ser ser pro trp gin val tyr gly phe val arg 
CTC CGC CAG CAC AGC AGC CCC TGG CAG GTG TAC GGC TTC GTG CGG 

470 480 
ala cys ieu arg arg ieu val pro pro gly leu trp gly ser arg 
GCC TGC CTG CGC CGG CTG GTG CCC CCA GGC CTC TGG GGC TCC AGG 

490 

his asn glu arg arg phe leu arg asn thr lys lys phe ile ser 
CAC AAC GAA CGC CGC TTC CTC AGG AAC ACC AAG AAG TTC ATC TCC 

300 510 
leu gly lys his ala lys leu ser leu gin glu leu chr trp lys 
CTG GGG AAG CAT GCC AAG CTC TCG CTG CAG GAG CTG ACG TGG AAG 

520 

men ser val arg asp cys ala trp leu arg arg ser pro gly val 
ATC- AGC GTG CGG GAC TGC GCT TGG CTG CGC AGG AGC CCA GGG GTT 

530 540 
gly cys val pro ala aia glu his arg ieu arg glu glu ile leu 
GGC TGT GTT CCG GCC GCA GAG CAC CGT CTG CGT GAG GAG ATC CTG 

550 

ala lys phe leu his trp leu met ser val tyr val val glu leu 
GCC AAC- TTC CTG CAC TGG CTG ATG AGT GTG TAC GTC GTC GAG CTG 

550 570 
ieu arg ser phe phe tyr val thr glu thr thr phe gin lys asn 
CTC AGG TCT TTC TTT TAT GTC ACG GAG ACC ACC- TTT CAA AAG AAC 

580 

arg leu phe phe tyr arg pro ser val trp ser lys leu gin ser 
AGG CTC TTT TTC TAC CGG CCG AGT GTC TGG AGC AAG TTG CAA AGC 

590 600 
ile gly :le arg gin his leu lys arg val gin leu arg glu leu 
ATT GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG 



# 
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610 

ser glu ala glu val arg gin his arg glu ala arg pre ala leu 
TCG GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG 

620 630 
leu thr ser arg leu arg phe ile pro lys pro asp gly leu arg 
CTG ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG 

640 

pro ile val asn met asp tyr val val giy ala arg thr phe arg 
CCG ATT GTG AAC ATC- GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC 

650 660 
arg glu lys arg ala glu arg leu thr ser arg val lys ala leu 
AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG 

670 

phe ser val leu asn tyr glu arg ala arg arg pro gly leu leu 
TTC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG 

680 690 
gly ala ser val leu gly leu asp asp ile his arg ala trp arg 
GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC 

700 

thr phe val leu arg val arg ala gin asp pro pro pro giu leu 
ACC TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG 

710 720 
tyr phe val lys val asp val thr gly ala tyr asp thr ile pro 
TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC 

720 

gin asp arg leu thr glu val ile ala ser ile ile lys pro gin 
CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG 

740 750 
asn thr tyr cys val arg arg tyr ala val val gin lys ala ala 
AAC ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC 

760 

his giy his val arg lys ala phe lys ser his val ser thr leu 
CAT GGG CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC TCT ACC TTC- 
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770 "80 
thr aso leu gin pro tyr met arg gin phe val ala his leu gin 
ACA GAC CTC CAG CCG TAC ATG CGA CAG TTC GTG GCT CAC CTG CAG 



glu thr ser pro leu arg asp 
GAG ACC AGC CCG CTG AGG GAT 

S00 

ser leu asn glu ala ser ser 
TCC CTG AAT GAG GCC AGC AGT 



790 

ala val val ile glu gin ser ser 
GCC GTC GTC ATC GAG CAG AGC TCC 

810 

gly leu phe asp val phe leu arg 
GGC CTC TTC GAC GTC TTC CTA CGC 



820 

phe met cys his his ala val arg ile arg gly lys ser cyr val 
TTC ATG TGC CAC CAC GCC GTG CGC ATC AGG GGC AAG TCC TAC GTC 



230 840 
gin cys gin gly ile pro gin gly ser ile ieu ser thr leu ieu 
CAG TGC CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC ACG CTG CTC 



850 

cys ser t leu cys cyr gly asp met glu asn iys leu phe ala gly 
TGC AGC CTG TGC TAC GGC GAC ATG GAG AAC AAG CTG TTT GCG GGG 



860 370 
ile arg arg asp gly leu leu leu arg ieu val asp asp phe leu 
ATT CGC- CGG GAC GGG CTG CTC CTG CGT TTC- GTG GAT GAT TTC TTG 



880 

ieu val thr pro his leu thr his ala Iys thr phe Ieu arg thr 
TTC- GTG ACA CCT CAC CTC ACC CAC GCG AAA ACC TTC CTC AGG ACC 



390 900 
ieu val arg gly val pro glu tyr gly cys val val asn leu arg 
CTC- GTC CGA GGT GTC CCT GAG TAT GGC TGC GTG GTC- AAC TTG CGG 



Iys thr val val asn phe pro 
AAG ACA GTG GTG AAC TTC CCT 

920 

thr aia phe val gin met pre 
ACG GCT TTT GTT CAG ATG CCG 



910 

val glu asp glu aia ieu gly gly 
GTA GAA GAC GAG GCC CTG GGT GGC 

930 

ala his gly leu phe pro trp cys 
GCC CAC GGC CTA TTC CCC TGG TGC 
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gly leu leu ieu asp thr arg 
GGC CTG CTG CTG GAT ACC CGG 

950 

ser ser cyr ala arg chr ser 
"TCC AGC TAT GCC CGG ACC TCC 



arg gly phe lys ala gly arg 
CGC GGC TTC AAG GCT GGG AGG 



val leu arg leu lys cys his 
GTC TTG CGG CTG AAG TGT CAC 



a. sr. ser leu gin chr val cys 
AAC AGC CTC CAG ACG GTG TGC 

1010 

leu gin ala cyr arg phe his 
CTG CAC- GCG TAC AGG TTT CAC 



his gin gin val crp lys asn 
CAT CAG CAA GTT TGG AAG AAC 

1040 

ser asp chr ala ser leu cys 
TCT GAC ACG GCC TCC CTC TGC 



ala gly met ser leu gly ala 
GCA GGG ATG TCG CTG GGG GCC 

1070 

ser glu ala val gin crp leu 
TCC GAG GCC GTG CAG TGG CTG 



94G 

chr leu glu val gin ser asp cyr 
ACC CTG GAG GTG CAG AGC GAC TAC 

960 

ile arg ala ser val thr phe asn 
ATC AGA GCC AGT GTC ACC TTC AAC 

970 

asn met arg arg lys leu phe gly 
AAC ATG CGT CGC AAA CTC TTT GGG 

990 

ser leu phe leu asp ieu gin val 
AGC CTG TTT CTG GAT TTG CAG GTG 

1000 

chr asn ile cyr lys ile leu leu 
ACC AAC ATC TAC AAG ATC CTC CTG 

1020 

ala cys val leu gin ieu pro phe 
GCA TGT GTC- CTG CAG CTC CCA TTT 

1030 

pro chr phe phe ieu arg val ile 
CCC AC A TTT TTC CTG CGC GTC ATC 

1050 

cyr ser ile ieu lys ala lys asn 
TAC TCC ATC CTG AAA GCC AAG AAC 

1060 

lys gly ala ala gly pro leu pro 

AAG GGC GCC GCC GGC CCT CTG CCC 

1080 

cys his gin ala phe leu leu lys 
TGC CAC CAA GCA TTC CTG CTC AAG 



1090 

ieu chr arg his arg val chr cyr val pro leu leu gly ser leu 
CTG ACT CGA CAC CGT GTC ACC TAC GTG CCA CTC CTC- GGG TCA CTC 
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1100 



1110 



arg chr ala gin thr gin leu ser arg lys leu pro gly thr thr 
AGG ACA GCC CAG ACG CAG CTG AGT CGG AAG CTC CCG GGG ACG ACG 



leu thr ala leu glu aia ala ala asn pre ala leu pro ser asp 
CTG AC" GCC CTG GAG GCC GCA GCC AAC CCG GCA CTG CCC TCA GAC 



phe lys chr lie leu asp 0? 

TTC AAC- ACC ATC CTG GAC TGA TGGCCACCCGCCCACAGCCAC-GCCGAGAGCAGA 

CACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACACCC 

AGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGC-CCGAGGCCTGCATGTCC 

GGCTGAAGGCTGAGTGTCCGGC7GAGGCCTGAGCGAGTGTCCAGCCAA.GGGC7GAGTGTC 

CAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAG 

CTT7TCYTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATT 

CGCCATTGTTCACCCYTCGCCCTGCCYTCCTTTGCCTTCCACCCCCACCATCCAGGTGGA 

GACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTA 

CACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAAT7GGGGGGAGGTGC 

TGTGGGAGTAAJU^TA-CTGAATATATGAGTTTTTCAGTTTTGRAAAAAAAAAA^ 

AAAAAAAAAA 



1120 



1130 



1132 



# * 
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